Genetically fused globin-like polypeptides having hemoglobin-like activity

ABSTRACT

The alpha subunits of hemoglobin, which in nature are formed as separate polypeptide chains which bind noncovalently to the beta subunits, are here provided in the form of the novel molecule di-alpha globin, a single polypeptide chain defined by connecting the two alpha subunits either directly via peptide bond or indirectly by a flexible amino acid or peptide linker. Di-alpha globin may be combined in vivo or in vitro with beta globin and heme to form hemoglobin. Di-alpha globin is expressed by recombinant DNA techniques. Di-beta globin may be similarly obtained.

This is a division of application Ser. No. 07/789,179 filed Nov. 8, 1991, now U.S. Pat. No. 5,545,727, which is a CIP of Ser. No. 07/671,707 filed Apr. 1, 1991, now abandoned, which is a CIP of PCT/US90/02654 filed May 10, 1990, now abandoned, which is a CIP of (a) Ser. No. 07/374,161 filed Jun. 30, 1989, now abandoned, (b) Ser. No. 07/379,116 filed Jul. 13, 1989, now abandoned, and (c) Ser. No. 07/349,623 filed May 10, 1989, now abandoned, all hereby incorporated by reference herein.

CROSS REFERENCE TO RELATED APPLICATIONS

Hoffman and Nagai, U.S. Ser. No. 07/194,338, filed May 10, 1988, now U.S. Pat. No. 5,028,588, presently owned by Somatogen, Inc., relates to the use of low oxygen affinity and other mutant hemoglobins as blood substitutes, and to the expression of alpha and beta globin in nonerythroid cells. Hoffman and Nagai, U.S. Ser. No. 07/443,950, filed Dec. 1, 1989, discloses certain additional dicysteine hemoglobin mutants; it is a continuation-in-part of 07/194,338. Anderson, et al., HEMOGLOBINS AS DRUG DELIVERY AGENTS Atty. Docket.: ANDERSON5-USA, filed Nov. 8, 1991, discloses use of conjugation of hemoglobins with drugs as a means for delivery of the drug to a patient.

FIELD OF THE INVENTION

The present invention relates to the intracellular assembly of a hemoglobin-like protein in biologically functional, substantially soluble form through co-expression of alpha- and beta-globin-like polypeptides in bacterial or yeast cells.

It further relates to the genetic cross-linking of the two alpha subunits of hemoglobin to form a novel polypeptide, di-alpha globin, which may be considered a partially assembled intermediate leading to a hemoglobin-like protein, and the use of this compound in the production of synthetic hemoglobins having an increased intravascular half-life as compared to stroma-free hemoglobins. It also relates to the analogous polypeptide di-beta globin.

INFORMATION DISCLOSURE STATEMENT

It is not always practical to transfuse a patient with donated blood. In these situations, use of a red blood cell substitute is desirable. The product must effectively transport O₂, just as do red blood cells. ("Plasma expanders", such as dextran and albumin, do not transport oxygen.) The two types of substitutes that have been studied most extensively are hemoglobin solutions and fluorocarbon emulsions.

A. Structure and Function of Hemoglobin

Hemoglobin (Hgb) is the oxygen-carrying component of blood. Hemoglobin circulates through the bloodstream inside small enucleate cells called erythrocytes (red blood cells). Hemoglobin is a protein constructed from four associated polypeptide chains, and bearing prosthetic groups known as hemes. The erythrocyte helps maintain hemoglobin in its reduced, functional form. The heme iron atom is susceptible to oxidation, but may be reduced again by one of two enzyme systems within the erythrocyte, the cytochrome b₅ and glutathione reduction systems.

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2, because it comprises two alpha and two beta chains). The alpha chain consists of 141 amino acids. The iron atom of the heme (ferroprotoporphyrin IX) group is bound covalently to the imidazole of His 87 (the "proximal histidine"). The beta chain is 146 residues long and heme is bound to it at His 92. Apohemoglobin is the heme-free analogue of hemoglobin; it exists predominantly as the αβ-globin dimer.

Separated, heme-free, alpha-and beta-globins have been prepared from the heme-containing alpha and beta subunits of hemoglobin. The separated heme-free globin chains are folded very differently, even though the heme-containing subunits are highly similar in secondary structure and basic folding features. This shows that the binding of the prosthetic heme group to globin subunits has quite different effects on alpha and beta globin. Yip, et al., J. Biol. Chem., 247: 7237-44 (1972).

Native human hemoglobin has been fully reconstituted from separated heme-free globin alpha and beta globin and hemin. Preferably, heme is first added to the alpha-globin subunit. The heme-bound alpha globin is then complexed to the heme-free beta subunit. Finally, heme is added to the half-filled globin dimer, and tetrameric hemoglobin is obtained. Yip, et al., PNAS (USA), 74: 64-68 (1977).

In cell-free systems prepared from unfractionated rabbit reticulocyte hemolysates, globin is actively synthesized for approximately five minutes, and then protein synthesis abruptly ceases. Prior addition of hemin prevents or delays the cessation of synthetic activity, as a result of the effect of hemin on an inhibitory protein known as "hemin-regulated inhibitor" (HRI). Hemin deficiency has a more severe effect on alpha chain synthesis than on beta chain synthesis as alpha globin mRNA is less efficient than beta-globin mRNA in initiating polypeptide chain synthesis. It has been speculated that alpha chains are released from their site of synthesis only in the presence of free beta chains, which immediately complex the released alpha chains to form αβ-globin dimers. These then combine with heme to form tetrameric hemoglobin. Winterhalter and Huehns, J. Biol. Chem., 239: 3699 (1964). It is certainly known that the addition of heme to αβ-globin dimers (apohemoglobin) leads to the rapid formation of hemoglobin.

The human alpha and beta globin genes reside on chromosomes 16 and 11, respectively. Bunn and Forget, Hemoglobin: Molecular, Genetic and Clinical Aspects (W. B. Saunders Co., Philadelphia Pa. 1986). Both genes have been cloned and sequenced, Liebhaber, et al., PNAS 77: 7054-58 (1980) (alpha-globin genomic DNA); Marotta, et al., J. Biol. Chem.; 252 540-53 (1977) (beta globin cDNA); Lawn, et al., Cell, 21:647 (1980) (beta globin genomic DNA).

Hemoglobin exhibits cooperative binding of oxygen by the four subunits of the hemoglobin molecule (two alpha-globins and two beta-globins in the case of Hgb A), and this cooperativity greatly facilitates efficient oxygen transport. Cooperativity, achieved by the so-called heme-heme interaction, allows hemoglobin to vary its affinity for oxygen. Hemoglobin reversibly binds up to four moles of oxygen per mole of Hgb. oxygen-carrying compounds are frequently compared by means of a device known as an oxygen dissociation curve. This curve is obtained when, for a given oxygen carrier, oxygen saturation or content is graphed against the partial pressure of oxygen. For Hgb, the percentage of saturation increases with partial pressure according to a sigmoid relationship. The P₅₀ is the partial pressure at which the oxygen-carrying solution is half saturated with oxygen. It is thus a measure of oxygen-binding affinity; the higher the P₅₀ , the more loosely the oxygen is held.

When the oxygen dissociation curve of an oxygen-carrying solution is such that the P₅₀ is less than that for whole blood, it is said to be "left-shifted." The oxygen affinity of hemoglobin is lowered by the presence of 2,3-diphosphoglycerate (2,3-DPG), chloride ions and hydrogen ions. Respiring tissue releases carbon dioxide into the blood and lowers its pH (i.e. increases the hydrogen ion concentration), thereby causing oxygen to dissociate from hemoglobin and allowing it to diffuse into individual cells.

The ability of hemoglobin to alter its oxygen affinity, increasing the efficiency of oxygen transport around the body, is dependent on the presence of the metabolite 2,3-DPG. Inside the erythrocyte 2,3-DPG is present at a concentration nearly as great as that of hemoglobin itself. In the absence of 2,3-DPG "conventional" hemoglobin binds oxygen very tightly and would release little oxygen to respiring tissue.

Aging erythrocytes release small amounts of free hemoglobin into the blood plasma where it is rapidly bound by the scavenging protein haptoglobin. The hemoglobin-haptoglobin complex is removed from the blood and degraded by the spleen and liver.

B. Blood Substitutes, Generally

It is clear from the above considerations that free native hemoglobin A, injected directly into the bloodstream, would. not support efficient oxygen transport about the body. The essential allosteric regulator 2,3-DPG is not present in sufficient concentration in the plasma to allow hemoglobin to release much oxygen at venous oxygen tension.

Nonetheless, solutions of conventional hemoglobin have been used as RBC substitutes. The classic method of preparing hemoglobin solutions employs outdated blood. The red cells are lysed and cellular debris is removed, leaving what is hopefully "stromal-free hemoglobin" (SFH).

Several basic problems have been observed with this approach. The solution must be freed of any toxic components of the red cell membrane without resorting to cumbersome and tedious procedures which would discourage large-scale production. DeVenuto, "Appraisal of Hemoglobin Solution as a Blood Substitute", Surgery Gynecology and Obstetrics, 149: 417-436 (1979).

Second, as expected, such solutions are "left-shifted" (lower P₅₀) as compared to whole blood. Gould, et al., "The Development of Polymerized Pyridoxylated Hemoglobin Solution as a Red Cell Substitute", Ann. Emerg. Med. 15: 1416-1419 (Dec. 3, 1986). As a result, the oxygen affinity is too high to unload enough oxygen into the tissues. Benesch and Benesch, Biochem. Biophys. Res. Comm., 26:162-167 (1967).

Third, SFH has only a limited half-life in the circulatory system. This is because oxy Hgb partially dissociates into a dimer (αβ) that is rapidly cleared from the blood by glomerular filtration and binding to circulating haptoglobulin. If large amounts of soluble hemoglobin are introduced into the circulation, glomerular filtration of the dimers may lead to a protein and iron load on the kidney capable of causing renal damage. Bunn, H.F., et al. (1969) The renal handling of hemoglobin I. Glomerular filtration. J. Exp. Med. .129:909-923; Bunn, H. F., and J. H. Jandl; (1969) The renal handling of hemoglobin II. Catabolism. J. Exp. Med. 129:925-934; Lee, R. L., et al. (1989) Ultrapure, stroma-free, polymerized bovine hemoglobin solution: Evaluation of renal toxicity (blood substitutes) J. Surgical Res. 47:407-411; Feola, M., et al. (1990) Nephrotoxicity of hemoglobin solutions. Biomat. Art. Cell Art. Org., 18(2):233-249; Tam, S. C. and J. T. F. Wong (1988) Impairment of renal function by stroma-free hemoglobin in rats. J. Lab. Clin. Med. 111:189-193.

Finally, SFH has a high colloid osmotic pressure (COD). Thus, administration of SFH in a dose that would have the same oxygen-carrying capacity as a unit of packed red blood cells is inadvisable, since the high osmotic pressure (60 mm Hg) would cause a massive influx of water from the cells into the bloodstream, thus dehydrating the patient's tissues. This consideration limits the dose of SFH to that which provide a final concentration of about 6-8 gm Hgb/dl.

In an effort to restore the desired P₅₀ 0, researchers added 2,3-DPG to the hemoglobin solution. Unfortunately, 2,3-DPG was rapidly eliminated from the circulation. Scientists then turned to other organic phosphates, particularly pyridoxal phosphate. Like 2,3-DPG, these compounds stabilized the "T state" of the Hgb by forming a salt bridge between the N-termini of the two beta chains. The pyridoxylated hemoglobin had a P₅₀ of 20-22 torr, as compared to 10 torr for SFH and 28 torr for whole blood. While this is an improvement over SFH, the pyridoxylated Hgb remains "high affinity" relative to whole blood.

C. Chemical Crosslinking of Hemoglobin Subunits

The properties of hemoglobin have been altered by specifically chemically crosslinking the alpha chains between the Lys99 of alpha₁ and the Lys99 of alpha2. Walder, U.S. Pat. Nos. 4,600,531 and 4,598,064; Snyder, et al., PNAS (USA) 84: 7280-84 (1987); Chaterjee, et al., J. Biol. Chem., 261: 9927-37 (1986). The P₅₀ was 29 mm Hg, and renal excretion was abrogated by the crosslinking, but the plasma half-life was increased just 2-3 fold.

This chemical crosslinking was accomplished by reacting bis(3,5-dibromosalicyl) fumarate with deoxyhemoglobin A in the presence of inositol hexaphosphate. This reaction has a low yield (10-20%). Moreover, purification is required to eliminate derivatives modified at other sites (there are 42 other lysine residues and the amino terminal amino groups of the four chains at which competing reactions could occur).

A further problem with the use of a "diaspirin " crosslinking agent is that it can participate in a side reaction yielding a carcinogenic halophenol.

In the hemoglobin analogue of the present invention, the N-terminal valine and c-terminal arginine of the alpha globins are connected by means of an amino acid or peptide linker, without resort of special coupling agents.

The beta chains have also been chemically crosslinked. Kavanaugh, et al., Biochemistry, 27: 1804-8(1988). Kavanaugh notes that the beta N-termini are 16 Å apart in the T state and 20 Å apart in the R state. Not surprisingly, the introduction of a DIDS bridge between the N-termini of T state hemoglobin hindered the shift to the R state, thereby decreasing the O₂ affinity of the molecule. while the Kavanaugh analogue has desirable oxygen binding and renal clearance characteristics, it too is obtained in low yield.

D. Gene Expression, Generally

Gene expression embraces the transcription of DNA into messenger RNA, and the translation of messenger RNA into protein. The process of transcription begins when an enzyme, DNA-directed RNA polymerase, binds to DNA. The binding site for this enzyme is often called the "promoter," and the binding of the enzyme to the promoter may be controlled by various repressors or inducers of transcription. The RNA polymerase slides along the DNA molecule, manufacturing a messenger RNA transcript. When it encounters a second regulatory element, the "terminator," the enzyme falls off, and the mRNA transcript is formed.

Messenger RNA is used by the ribosomes, the protein factories of the cell, as a template for the construction of the corresponding protein. The ribosomal binding site comprises the so-called Shine Delgarno (SD) sequence and a properly spaced initiation (start) codon. Beginning at a special RNA triplet known as the initiation codon, transfer RNAs bind to corresponding codons of the messenger. Each transfer RNA is two-handed; it binds to the messenger codon by means of a complementary anti-codon, while holding the corresponding amino acid in position to be linked into the growing polypeptide chain. The chain falls off when the ribosome encounters one of three special triplets known as "stop" codons. That part of the original gene which corresponds to the messenger sequence from the initiator codon to the last codon before the stop codon is known as the coding sequence. There is also a 5'-flanking sequence, which begins with the promoter, and a 3'-flanking sequence which ends with the terminator.

E. Polycistronic Expression

It is possible for a single messenger RNA transcript to have one promoter, but two or more pairs of start and stop codons that define distinctly translatable sequences. Each such sequence is known as a "cistron," and the polypeptide corresponding to the cistrons are thus co-expressed under the control of the single promoter.

The majority of bacterial operons are polycistronic, that is, several different genes are transcribed as a single message from their operons. Examples include the lactose operon with three linked genes (lacZ, lacY and lacA) and the tryptophan operon with five associated genes (trpe, trpD, trpC, trpB, and trA). In these operons, the synthesis of messenger RNA is initiated at the promoter and, within the transcript, coding regions are separated by intercistronic regions of various lengths. (An operon is a cluster of genes that is controlled as a single transcriptional genetic unit). Translational efficiency varies from cistron to cistron. Kastelein, et al., Gene, 23: 245-54 (1983).

When intercistronic regions are longer than the span of the ribosome (about 35 bases), dissociation at the stop codon of one cistron is followed by independent initiation at the next cistron. With shorter intercistronic regions, or with overlapping cistrons, the 30S subunit of a terminating ribosome may fail to dissociate from the polycistronic mRNA, being instantly attracted to the next translational initiation site. Lewin, Gene Expression, 143-148.

Unlike bacterial mRNAs, eukaroyotic mRNAs are generally monocistronic in nature. Lewin, Gene Expression, 157.

Synthetic polycistronic operons have been constructed and expressed in both prokaryotes and eukaryotes.

Lee, et al., Nucleic Acids Res., 12: 6797 (1984) describe a special case of a synthetic polycistronic operon in which all of the cistrons express the same polypeptide. This homopolycistronic structure was constructed to maximize the gene dosage effect.

Schoner, et al., PNAS, 83: 8506-10 (1986) translated a synthetic two-cistron mRNA in E. coli. The first cistron was a short, arbitrary AU-rich sequence, while the second cistron was a mammalian gene (bGH). It was found that "read through" translation occurred if the stop codon of the first cistron followed the SD element of the second cistron and lay close to the start codon of the second cistron. Schoner's purpose was to overcome his failure to express Met-bGH with its native codons at high levels, possibly as a result of inhibition of translation by local secondary structures. The first cistron was engineered to favor ribosome binding (by placing the SD sequence and the AUG initiation codon in an AU-rich region free of local secondary structure). See also Schoner, et al., Meth. Enzymol., 153: 401-416 (1987) which reveals that bGH overproduction by this technique was associated with the formation of protein granules.

Saito, et al., J. Biochem., 101: 1281-88 (1987) expressed a synthetic somatomedin C gene in E. coli using a two cistron system. They theorized that the instability of somatomedin C, a basic polypeptide, might be overcome by complexing it with an acidic polypeptide. Thus, they constructed a two-cistron system which could express both polypeptides. The termination codon of the first cistron overlapped the initiation codon of the second cistron. The transformants accumulated Somatomedin C at high levels. However, the somatomedin C was recovered in the form of insoluble pellets (see page 1282).

The ribosomes of mammalian cells are likewise capable of reinitiating translation at an initiation codon downstream from a termination codon. Thus, Boel, et al., FEBS Lett., 219:181 (1987) expressed a dicistronic transcription unit in mammalian (CHO) cells. This unit directed synthesis of both the precursor of human pancreatic polypeptide and of a selectable genetic marker (mouse DHFR).

CODON, WO88/05486 describes the production of dicistronic mRNA which encodes both a protein of interest (e.g., tissue plasminogen activator) and a selectable phenotype (e.g., neomycin resistance). The common promoter was, in each of the examples a derivative of the Harvey murine sarcoma virus, and the dicistronic mRNA was translated in suitable eukaryotic cells.

GENENTECH, EP Appl 117,058 discloses the expression in vertebrate host cells of a dicistronic expression vector wherein one cistron codes for the desired protein (e.g., HbsAg) and a second codes for a second protein (e.g., DHFR) whose synthesis is subject to environmental control(e.g., with methotrexate).

F. Fused Genes and Proteins, Generally

Genes may be fused together by removing the stop codon of the first gene, and joining it in phase to the second gene. Parts of genes may also be fused, and spacer DNAs which maintain phase may be interposed between the fused sequences. The product of a fused gene is a single polypeptide, not a plurality of polypeptides as is expressed by a polycistronic operon. Different genes have been fused together for a variety of purposes. Thus, Gilbert, U.S. Pat. No. 4,338,397 inserted a rat preproinsulin gene behind a fragment of the E. coli penicillinase gene. His purpose was to direct E. coli transformants to secrete the expression product of the fused gene. Fused genes have also been prepared so that a non-antigenic polypeptide may be expressed already conjugated to an immunogenic carrier protein. The present invention, however, contemplates the joining of two copies of the same gene.

The use of linker DNA sequences to join two different DNA sequences is known. These linkers are used to provide restriction sites for DNA cleavage, or to encode peptides having a unique character that facilitates purification of the encoded fusion protein or a fragment thereof. See, e.g., Rutter, U.S. Pat. No. 4,769,326.

The lectin of Pisum sativum seeds is synthesized as a single 275-amino acid preproprotein consisting of a signal sequence followed first by the beta chain and then by the alpha chain. The signal sequence is removed in the endoplasmic reticulum, and in the protein bodies the resulting "prolectin" is cleaved into a 187-AA beta chain and a 58-AA alpha chain. (Further processing results in truncation at the carboxyl termini). While the pea seed isolate is thus a heterodimer, it was discovered that the uncleaved naturally-occurring "prolectin" also binds carbohydrates, and that this "prolectin" could be expressed in E. coli and recovered in functional form. Stubbs, et al., J. Biol. Chem., 261: 6141-44 (1986).

Toth, U.S. Pat. No. 4,774,180 teaches the expression of polyprotein. This polyprotein was made from a fused DNA sequence encoding both a first polypeptide which catalyzes the reaction of glycine with ATP to form glycyl-adenylate and a second polypeptide which reacts glycyl adenylate with tRNA^(GLY) to obtain the glycine-charged tRNA. These two polypeptides are the alpha and beta subunits of glycine tRNA synthetase which has an α₂ β₂ quaternary structure. The two subunits, in the E. coli genome, are encoded by a single dicistronic gene. Toth linked the coding region of the alpha chain to the coding region of the beta chain by means of a linker encoding six amino acids. See also Toth and Schimmel, J. Biol. Chem., 261: 6643-46 (May 1986).

Ladner, U.S. Pat. No. 4,704,692 describes an expert system for finding linkers which may be used to convert two naturally aggregated but chemically separated polypeptide chains into a single polypeptide chain with a similar conformation after folding. This system relies on a database containing amino acid sequences for which 3-D structures are known. The database is examined for candidate amino acid sequences with a span similar in length to the interchain gap to be bridged. The direction and orientation of the candidate peptides in then checked. The algorithm assumes that these peptides will maintain the same length and orientation regardless of the flanking sequences.

Ladner, WO88/06601 presents a hypothetical approach to the preparation of "pseudodimeric" analogues of dimeric repressor proteins. In essence, an amino acid linker is introduced to convert the dimeric molecule into a single chain. According to Ladner, this linker may be designed directly by the method of the '692 patent; alternatively, the linker-encoding DNA is a random oligonucleotide and in vivo selection is used to find a pseudodimer whose linker permits the molecule to fold correctly and bind sequence-specifically to DNA.

Hallewell, et al., J. Biol. Chem., 264: 5260-68 (1989) prepared an analogue of CuZn superoxide dismutase. Each dismutase molecule is a dimer of two identical subunits; a copper ion and a zinc ion are liganded to the subunit. The dimer interaction in CuZn superoxide dismutase is so strong that the subunits have not been separated without inactivating the enzyme. The enzyme has considerable conformational similarity to immunoglobulins; Hallewell, et al., joined two human superoxide dismutase genes, either directly or with DNA encoding a 19-residue human immunologlobulin IgA1 hinge region and expressed the fused genes in a transformed host. In attempting to express the directly joined genes, recombination occurred to eliminate one of the tandem genes in some plasmid molecules. Hallewell, et al., postulated that the direct connection distorted the dimer, causing the exposure of hydrophobic areas which then had a toxic effect. This would have provided selection pressure favoring gene deletion. No recombination was detected with the IgA1 linker construction.

Unfortunately, it cannot be assumed that a pseudodimeric fusion protein containing a peptide linker will fold properly so to be a functional equivalent of its parental heterodimer.

G. Expression of Soluble Proteins

Efforts to produce heterologous proteins in transformed cells sometimes result in the precipitation of some or all of the protein as insoluble inclusion bodies, also known as refractile bodies. See, e.g., Paul, et al., Eur. J. Cell Biol., 31:171-174 (1983) (human proinsulin/E. coli trpE fusion protein); Denefle, et al., Gene, 56:61-70 (1987) (angogenin); Langley, et al., Eur. J. Biochem., 163:313-321 (1987) (bovine growth hormone); Petrov, et al., Biology of the Cell, 61:1-4 (1987) (calcitonin); Richardson, et al., Biochim. Biophys. Acta, 950:385-94 (1988) (ricin B chain); Davis, et al., Biochemistry, 26:1322-26 (1987) (tumor necrosis factor); Lee, et al., Biochim. Biophys. Res. Commun., 151:598-607 (1988) (gamma interferon); Meng, et al., J. Chromatogr., 443:183-92 (1988) (Somatomedin C); Tsuji, et al., Biochemistry, 26:3129-34 (1987) (interleukin-2). The term "refractile" refers to the ability to observe these bodies by phase contrast microscopy. Frequently, this insoluble protein retains only a fraction of the expected biological activity, possibly due to incorrect folding. It has been suggested that inclusion bodies are formed when molecules of partially folded proteins interact with each other faster than they can fold into their native, active conformation. Kruger, et al., Biopharm, 40 (March 1989); Haase-Pettingell and King, J. Biol. Chem., 263:4977-83 (1988). "Factors contributing to the formation of inclusion bodies in recombinant bacteria remain obscure and it is not easy to predict the physical state of the product of a newly ex pressed eukaryotic gene in E. coli." Petrov, supra.

While the formation of these inclusion bodies results in enrichment of the recombinant protein, and is therefore sometimes desirable, it also necessitates solubilization of the aggregates and regeneration of the protein's biological activity. Petrov, supra at 4, comments, "sometimes these obstacles seem to be the most critical point of the recombinant technology."

Attempts have been made to solubilize and renature these proteins. Wetzel, U.S. Pat. No. 4,599,197; Builder, U.S. Pat. No. 4,620,948; Olson, U.S. Pat. No. 4,511,503; Jones, U.S. Pat. No. 4,512,922. However, such efforts can be laborious and uncertain of success. See Giantini and Shatkin, Gene, 56:153-160 (1987). As stated by Weir and Sparks, Biochem. J., 245: 85-91 (1987), "proteins vary considerably in their optimal conditions for renaturation; various factors such as pH, salt concentration and type, rate of removal of denaturant, concentration of the target protein and of contaminants may strongly affect the recovery of authentic protein." These complications are avoided if the protein of interest is expressed in soluble form.

Gatenby, et al., Eur. J. Biochem., 168: 227-31 (1987) has discussed difficulties in the preparation of the higher plant enzyme ribulose-bisphosphate carboxylase. This enzyme has the subunit structure L₈ S₈, where L is a large subunit and S is a small subunit. In nature, a binding protein apparently maintains L in soluble form prior to assembly with S. Attempts to assemble an active higher plant RuBPCase in E. coli have been frustrated by the formation of an insoluble, inactive aggregate of L.

H. Bacterial Expression of Human Alpha and Beta Globins

Nagai and Thorgerson (Nature, 309: 810-812, 1984) expressed in E. coli a hybrid protein consisting of the 31 amino-terminal residues of the lambda cII protein, an Ile-Glu-Gly-Arg linker, and the complete human beta globin chain. They cleaved the hybrid immediately after the linker with blood coagulation factor Xa, thus liberating the beta-globin chain. Later, Nagai, et al., P.N.A.S. (U.S.A.), 82:7252-55 (1985) took the recombinant DNA-derived human beta globin, naturally derived human alpha globin, and a source of heme and succeeded in producing active human hemoglobin. Because the alpha globin was derived from erythrocytes, the final product may contain undesirable erythrocyte membrane constituents.

More recently, an efficient bacterial expression system for human alpha globin was reported. GB 8711614, filed May 16, 1987; see copending Ser. No. 07/194,338 and WO 88/09179. This led to the production of wholly synthetic human hemoglobin by separate expression of the insoluble globin subunits in separate bacterial cell lines, and in situ refolding of the chains in the presence of oxidized heme cofactor to obtain tetrameric hemoglobin. This procedure is laborious and low in yield. It requires the use of denaturing solvents (urea and guanidine), and chemical reduction of ferric ion to the ferrous state (see example). One object of the present invention is to overcome these disadvantages.

While human alpha and beta globins may be expressed separately in E. coli, Walder, Proceedings, Biotech USA 1988 (San Francisco, Nov. 14-16, 1988) warns at page 360, "isolated alpha and beta globin! chains are unstable and tend to precipitate." If human alpha and beta globin are not produced in soluble form, they must be solubilized with denaturing agents and then refolded to restore activity. Moreover, when a wild-type alpha globin gene is expressed in E. coli, alpha globin accumulates only slowly. It is not certain whether this is due to inefficient translation or to the action of host proteases, but WO 88/09179 teaches that this problem may be overcome by fusing a short section of the beta globin gene to the alpha globin gene, so that a hybrid protein is produced. This hybrid protein must then be cleaved, e.g., with a protease, to release the globin. If the protease is not completely selective (perhaps because of contamination by other proteases), the desired cleavage product may not be the only one obtained. In any event, that product must be separated from other E. coli polypeptides, and any contaminants associated with the protease.

Sperm whale myoglobin has been expressed in E. coli, demonstrating that bacteria can incorporate prosthetic heme groups into a protein expressed from a cloned eukaryotic gene. Springer and Sligar, PNAS (USA) 84: 8961-65 (1987). Walder says, "it remains to be seen if hemoglobin can be similarly made if both the alpha and beta chains are expressed within the same cell." While there is a high degree of tertiary structure similarity between myoglobin (a single chain protein) and the individual alpha and beta globin subunits of hemoglobin, hemoglobin is a heterotetrameric protein, in which the primary globin sequences have no more than a 27% homology to myoglobin and moreover myoglobin is now known to enjoy significantly higher stability than either alpha or beta globin. Thus, it could not be predicted that co-expression of alpha- and beta-globin in the same cell would result in intracellular assembly of a functional hemoglobin, which requires proper folding of the alpha and beta chains and incorporation of heme.

I. Human Gene Expression in Yeast, Generally

A number of human proteins have been expressed in transformed yeast cells, especially Saccharomyces cerevisiae, either cytoplasmically or by secretion into the culture medium. King, et al., Biochem. Soc. Transac., 16:1083-1086 (1988). But success is not guaranteed. Thim, et al., FEBS Lett., 212:307-312 (1987) experienced difficulty in obtaining properly crosslinked insulin from yeast cells in which the intact proinsulin-encoding gene had been inserted. They overcame this problem by constructing a modified proinsulin gene which encoded the B and A chains linked by a hexapeptide spacer. The product of this gene was cleaved and the two chains were properly folded and crosslinked by the cells.

Richardson, et al., Biochim. Biophys. Acta, 950:385-94 (1988) expressed the B chain of the heterodimeric protein ricin in E. coli. They reported that it was hard to obtain high levels of secretion of a yeast alpha factor leader/ricin B chain fusion protein. No attempt was made to co-express and assemble the ricin A and B chains.

Murakami, et al., DNA, 6:189-97 (1987) reported production of a heme-containing fused enzyme in transformed yeast cells.

Horwitz, et al., PNAS (USA), 85:8678-82 (Nov. 1988) described the construction of yeast strains which secrete functional mouse variable region/human IgG1 constant region chimeric antibodies into the culture medium. They characterize their paper the first report of the secretion of a foreign multimeric or heterodimeric protein in yeast. But see also Carlson, Mol. Cell. Biol., 8:2638-46 (June 1988), showing transcription and translation of heavy and light-chain cDNAs into polypeptides which associate and bind antigen.

Beggs, et al., Nature, 283:835 (1980) attempted to express a chromosomal rabbit beta globin gene in S. cerevisiae. However, these yeast cells were unable to properly splice the intron-containing globin mRNA transcript.

No admission is made that any reference cited herein is prior art. The description of the work and the citation of publication date are based solely on the published information and the applicants reserve the right to question the accuracy of that information.

SUMMARY OF THE INVENTION

It is the object of this invention to overcome the aforementioned deficiencies of the prior art. For example, Applicants have achieved the first complete expression and assembly of tetrameric (including pseudotetrameric) hemoglobin in cells which do not produce hemoglobin in nature. Prior work has related to the separate expression of alpha and beta globin and their extracellular combination with heme to form hemoglobin.

A central feature of the present invention is the intracellular assembly of alpha- and beta-globin-like polypeptides and intracellular incorporation of heme to form a biologically functional hemoglobin-like protein. This intracellular assembly is achieved by expressing the alpha-and beta-globin-like polypeptides in the same cell so that they fold together and incorporate heme. An important characteristic of this invention is a substantial reduction in the formation of insoluble globin aggregates, in particular of beta globin, as compared to what is observed when globins are separately expressed in E. coli or S. cerevisiae. Co-expression may be achieved from genes on two separate but compatible plasmids in the same cell, or from two different operons on the same plasmid, or from a single polycistronic operon.

In one embodiment, the alpha- and beta-globin-like polypeptides are co-expressed in bacterial cells. The corresponding genes may be included in the same synthetic operon (i.e., driven by one promoter), or placed in separate operons with separate promoters (which may be the same or different). Preferably, expression of the alpha- and beta-globin is enhanced by placing a "ribosomal loader" sequence as hereafter described before each globin gene. This is particularly advantageous in the case of alpha globin which is more difficult to produce in quantity.

In another embodiment, the alpha- and beta-globin-like polypeptides are co-expressed in yeast cells. Improvements in both the yield of the alpha globin and the solubility of beta globin are obtained.

A further aspect of the invention is the production of novel intermediates, di-alpha globin and di-beta globin (and mutants thereof), which can be expressed in a cell (including, but not limited to, bacterial and yeast cells) and assembled with each other or with beta- or alpha-globin-like polypeptides, respectively, into a pseudotetrameric hemoglobin-like protein. While intracellular assembly is not strictly required, di-alpha and di-beta globin may be considered specially adapted to intracellular assembly of a functional hemoglobin since expression of, e.g., a di-alpha globin is analogous in some respects to intracellular assembly of two alpha globin subunits, differing from assembly as previously discussed in that the association is accomplished by expression of a covalent peptide linker rather than by noncovalent interaction of the subunits. Di-alpha and Di-beta-globin-like polypeptides may be expressed in, preferably, bacterial cells or in yeast cells.

Moreover, the expression of di-alpha or di-beta genetically stabilized hemoglobin-like proteins and the utilization of such as a blood substitute, prolongs the half-life of recombinant hemoglobin by reducing extravasation and glomerular filtration of dissociated subunits in vivo compared to native human hemoglobin. Our studies of hemoglobin is excreted at levels similar to control levels, while a similar recombinant hemoglobin excretion in rat urine have demonstrated that genetically stabilized recombinant human hemoglobin that was not so genetically stabilized, undergoes significant dissociation into dimers and is excreted at substantially higher levels. Furthermore, genetic stabilization of hemoglobin results in a two fold or greater increase in half-life of hemoglobin in the plasma of rats.

The invention further relates to production of octameric hemoglobins, and of certain higher multimers, by linkage of pseudotetramers in various configurations.

These facets of the invention are now discussed in greater detail.

Yeast Expression of Hemoglobin-Like Proteins

Applicant have discovered that it is possible to produce human hemoglobin (or mutants thereof) in yeast, especially Saccharomyces cerevisiae. The use of a yeast expression system obviates the need to separate the hemoglobin from bacterial endotoxins. We have also found that alpha and beta globins with the correct N-terminal amino acid may be obtained without first expressing the globin as a part of selectively cleavable fusion protein. We believe that this is because the yeast enzyme methionyl aminopeptidase is capable of removing the N-terminal methionine from Met-alpha-globin and Met-beta-globin to expose the desired N-terminal amino acid (Valine). Production of altered oxygen affinity mutants as discussed in WO88/09179 is of special interest. Such mutants may be produced by site-specific mutagenesis of globin genes followed by cloning and expression in yeast.

In a preferred embodiment, expression is controlled by a "gal-gap49" hybrid promoter as hereafter defined.

Co-Expression of Alpha and Beta Globin Genes in Yeast Cells

In a preferred embodiment, the alpha and beta globin genes are both expressed within the same yeast cell. Expression of the beta globin gene alone results in the production of beta globin as a largely insoluble, difficult-to-extract protein comprising less than 2% of the total cell protein. Expression of the alpha globin gene alone results in production of alpha globin at very low levels (under 0.5% of the total cell protein). In neither case is heme incorporated. When, however, the alpha and beta globin genes are co-expressed, the transformed yeast cells fold the alpha and beta globin chains together and incorporate heme groups to obtain functional recombinant human hemoglobin in soluble form, accumulating to about 10% of the total cell protein, without any change in the promoters operably linked to the genes.

The alpha and beta globin genes may, in turn, be carried on different plasmids or on the same plasmid within the host cell.

Polycistronic Co-Expression of the Alpha and Beta Globin Genes in Bacterial Cells.

Applicants have translationally coupled alpha and beta globin genes to a small "ribosomal loader" gene encoding a small consumable peptide that will lead the ribosome directly into the ATG of the desired alpha and beta globin message and thus enhance translational efficiency. The have also placed the alpha and beta globin genes in the same operon so they are transcribed into a single polycistronic mRNA transcript. The globins are then translated as separate polypeptide chains which subsequently are folded together and joined with intracellular heme by transformed cells to form the hemoglobin tetramer. Applicant's method overcomes the problem associated with separate purification of precipitated alpha and beta globins.

The polycistronic expression and assembly of a heterooligomeric human protein in soluble, active form in a heterologous host has not been previously reported. It is especially noteworthy that this was a mammalian protein expressed in a prokaryotic (bacterial) host. It should further be considered that this protein incorporates prosthetic groups, which add a further complication to the goal of proper post-translational processing.

In one embodiment, Met-FX-alpha globin and Met-FX-beta globin are co-expressed, where FX denotes a leader peptide which a recognition site for Factor Xa cleavage activity. FX-alpha globin and FX-beta globin assemble to form a mutant hemoglobin with reversible oxygen binding activity, albeit higher in affinity for oxygen than native hemoglobin. Alternatively, the FX leader, or other fused leader, may be cleaved to obtain a duplicate of native Hgb.

In another embodiment, Met-alpha globin and Met-beta globin are co-expressed. This eliminates the need for a cleavage step.

In a third embodiment, des-val-alpha globin and des-val beta globin are co-expressed. Native alpha and beta globin both begin with valine. The valine may, however, be replaced with methionine, which is of similar hydrophobicity.

In further embodiments, one or more codons of the native genes are altered so that a alpha and/or beta globin-related protein characterized by one or more amino acid differences (insertions, deletions or substitutions) from the native species is produced.

Globin Pseudodimers (Especially Di-Alpha and Di-Beta Globins) and Genetically Fused Hemoglobin Pseudotetramers, Etc.

A new protein, di-alpha globin, has been prepared, which consists of two alpha globin amino acid sequences covalently connected by peptide bonds, preferably through an intermediate linker of one or more amino acids. Surprisingly, these "genetically fused" alpha globin chains were capable of appropriately folding and combining with beta globin and heme to produce functional hemoglobin analogue. The term "genetically fused" refers to the method of production. Two copies of the globin gene are fused together, preferably with a spacer DNA encoding the amino acid linker, so the construct directly encodes the desired di-alpha globin. The term "analogue" is used because in native hemoglobin, the alpha₁ and alpha2 subunits are noncovalently bound. The analogous preparation of di-beta globin has also been accomplished. Methods for preparation of an analogous α₁ β₂ (or β₂ α₁) globin pseudodimer have been proposed.

The preparation of "genetically fused" hemoglobins avoids the disadvantages of chemical crosslinking. The latter is inefficient and often requires deoxygenation of the hemoglobin solution and the presence of another molecule (e.g., inositol hexaphosphate or 2,3-DPG) to prevent competing reactions.

In a preferred embodiment, the di-alpha globin and/or the beta globin contain mutations which reduce the oxygen-binding affinity of the hemoglobin analogue in solution so as to approach the oxygen-binding characteristics of whole blood.

The di-alpha hemoglobin advantageously exhibits a substantially longer half-life in the circulatory system than does conventional (des-val) recombinant hemoglobin. Preferably, in humans, the half-life exceeds 9 hours at a dose of at least 1 gm/kgm body weight. This would be expected to correspond to a half-life of about 3 hours in rats given a comparable dose.

Since the fusion prevents dissociation of the hemoglobin into αβ dimers, kidney function is protected.

The di-alpha, di-beta and alphabets globins can be expressed in cells conventionally used for expression of recombinant proteins especially bacteria and yeast.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1: Flowchart for construction of plasmids for expression of FX-alpha globin (pDL II-62m), FX-beta globin (pDL II-10a), and FX-hemoglobin (pDL II-66a) are schematized.

FIGS. 2a and 2b: Flowcharts for construction of plasmid pDL III-1a (2a-1) bearing dicistronic Des-Val-Alpha globin gene under control of Tac promoter, and polycistronic di-alpha/beta co-expression plasmid pDL III-47a (2b-1).

FIG. 3: Flowchart for construction of plasmid for co-expression of Met-alpha and Met-beta globins, pDL III-13e (3a and 3b).

FIG. 4: oligonucleotides for construction of synthetic FX-alpha and FX-beta globin genes (4a to 4c). The top strand is shown 5' to 3' and the bottom strand as 3' to 5'. Areas of overlap between complementary synthetic oligonucleotides are shown as areas where both strands are shown in the same case letters. The PstI site that joins FX-alpha and FX-beta occurs at the overlap of SJH I-35a and SJH I-36b.

FIG. 5: Synthetic gene for expression of Met-FX-alpha and Met-FX-beta globin (5a to 5c). Region A contains the alpha globin gene and region B the beta globin gene. The location of the Factor X sequence and the two Shine-Delgarno sequences (SD#1 and SD#2) in both regions is indicated. Selected restriction sites are also found. The translated amino acid sequences for the ribosomal loader and Met-FX-alpha/and beta-globin are given.

FIG. 6: Elution profile and absorbance spectrum for FX-hemoglobin.

FIG. 7: Oligonucleotides for construction of mutant hemoglobins (differing in amino acid sequence from conventional hemoglobin).

FIG. 8: Oligonucleotides for construction of plasmids which do not encode the Factor Xa substrate recognition site.

FIG. 9: Map of Plasmid pDL III-13e.

FIG. 10: Oxygen Binding of Des-Fx Hgb

FIGS. 11a and 11b: Plasmids pDL III-14c (11a) and pDL III-38b (11b).

FIG. 12 Shows the sequence of a preferred synthetic gene for expression of (des-Val)-alpha-(GlyGly)-alpha and des-Val beta globin. A shows the region (EcoRI to PstI) containing Shine-Delgarno ribosomal binding sites (SD#1 and SD#2), the sequence expressing the octapeptide (Met . . . Glu) which serves as a cotranslational coupler, and the sequence encoding the two nearly identical alpha globin-like polypeptides and the interposed Gly-Gly linker. The first alpha globin sequence begins "Met-Leu", that is, it contains an artifactual methionine, omits the valine which is the normal first residue of mature alpha globin, and continues with the second residue, leucine. The second alpha globin sequence begins "Val-Leu", immediately after the underlined "Gly-Gly" linker. Start and stop codons are underlined. B shows the analogous region (PstI to HindIII) containing the coding sequence for des-Val beta globin. A and B are connected at the PstI site to form a single polycistronic operon.

FIG. 13: Shows the structure of the final expression vector pDL III-47a. "PTac" is the Tac promoter, and "ampicillin" is the ampicillin resistance gene. FIG. 13a shows an XbaI-BamHI fragment of PDL III-47a.

FIG. 14: Plasmid pSGE0.0E4

FIG. 15: Plasmid pSGE1.1E4

FIG. 16: Plasmid pDL IV-67a

FIG. 17: Plasmid pJR VI-54a

FIG. 18: Plasmid pDL di-alpha/beta/beta

FIG. 19: Flowchart showing the construction of various expression vectors featuring lambda P_(L) regulation of various polycistronic globin operons (19a to 19c).

FIG. 20: Nucleotide Sequence of GAL-GAP promoter, with restriction sites indicated (20a and 20b). The region from SphI to EcoRV contains a synthetic GAL₁₋₁₀ regulatory region (M. Johnston and R. Davis. 1984. Molecular and Cellular Biology, 4:1440-1448). The UAS is in the region numbered 29-63 on this Figure. The region from EcoRV to the XbaI site contains the consensus GAP491 transcriptional start region, with the approximate start of transcription being at 395. (L. McAlister and M. J. Holland. 1983. J.

Biol. Chem., 260:15019-15027; J. Holland, et al. 1983. J. Biol. Chem., 258:5291-5299.)

FIGS. 21a and 21b: Flowcharts showing construction of beta-globin expression cassette (21a-1 TO 21a4 and 21b-1 to 21b-3).

FIGS. 22a 21b: Flowcharts showing construction of beta-globin expression vector pGS4988 (22a-1 and 22a-2) and alpha-globin expression vectors pGS4688 and pGS 4488 (22b).

FIGS. 23a and 23b: Flowcharts (23a and 23a-1 and 23b to 23b-3) showing construction of an alpha-globin expression cassette and of vectors pGS289 and pGS389 for co-expression of alpha- and beta globin from the same plasmid.

Note that alpha and beta globin are expressed from separate promoters.

FIG. 24: Absorption spectra for yeast-produced recombinant and native human hemoglobin.

FIGS. 25a and 25b: Flowchart (25a-1 to 25a-3) showing construction of di-alpha/beta hemoglobin yeast expression vector and map of plasmid pGS3089 (25b).

FIG. 26: Map of plasmid pGS3089RGV desβ.

FIG. 27: is a stylized representation of one form of pseudooctameric Hgb, in which the octameric hemoglobin is formed by linking or crosslinking two molecules of an asymmetric di-alpha Hgb.

FIG. 28: depicts coiled coil crosslinkers suitable for joining (28a) four or (28b) six Hgb tetramers. FIG. 28(c) is a top view of a 4-helical bundle, with attachment sites marked.

FIG. 29: maps plasmid pSGE1.1.1E4.

FIG. 30: is a flow chart showing the construction of a di-beta Hgb expression vector, pSGE1.05E4.

FIG. 31: maps plasmid pSGE1.05E4.

FIG. 32: The symmetry-averaged difference map rHb1.1-HbA superimposed on the atomic model. The map is contoured at plus (solid contours) and minus (broken contours) about 2 times the root-mean-square value of the electron density prior to symmetry averaging about the molecular dyad. Filled bonds represent the refined coordinates of the deoxy HbA, Fermi, G. Perutz, M., Shaanan, B., Fourme, R., J. Mol. Biol. 175, 159-174 (1984). Open bonds and single broken bonds represent approximate coordinates of the two symmetry-related configurations of the Hb1.1 diα-chain termini: α1-Gly-α2 (open bonds) and α2-Gly-a1 (broken bonds). X-ray data to 2.5 Å resolutions were collected from a single crystal on a FAST/MADNES area detector with a rotating anode source. Absorption, Lorentz and polarization corrections, and batch scale and temperature factors (batches of 5 degrees rotation in angle φ), were applied to the raw X-ray data, which were then merged to yield 18231 unique reflections, or 96% of the total between 10 and 2.5 Å resolution. Friedel pairs were measured for 76% of unique reflections; the R-factor on intensity between Friedel pairs in the reduced data set was 6.6% on intensity. The reduced data were converted to amplitudes and scaled to native deoxy HbA data with an R-Factor of 16%. The difference map was calculated with phases obtained from the atomic model of deoxy Hb.

FIG. 33 In vivo half-life of rHb1.1 and rHb1.0. The fraction of ³ H-rHb1.0 or rHb1.1 remaining in the intravascular compartment is shown in the y axis. The value for 100% was determined from samples taken 5 min. after administration of the hemoglobin samples. Radioactive hemoglobin was prepared by adding 5 C mCi or ³ H-leucine (Amersham, 1100-1500 Ci/mMole) to 2 L fermenters of E. coli JM109 pSGE1.1-E4! or JM109 pSGE1.0-E4! 15 min prior to induction with IPTG. Hemoglobin was purified essentially as described, Hoffman, S. J., Looker, D. L., Roehrich, J. M., Cozart, P., Durfee, S., Stetler, G., Proc. Nat. Acad. Sci. (USA) 87, 8521-25 (1990), and combined with unlabeled rHb1.1 or rHb1.0 to adjust the specific activity to 8.3×10³ dpm/mg (rHb1.1) and 9.5×10³ dpm/mg (rHb1.0). Eight Sprague-Dawley rats (200-300 gm) in two groups of four were administered either 1 ml of ³ H-rHb1.1 (156 mg/ml) through a venous catheter. Blood samples (0.2 ml) were removed at 5 min., 30 min., 1,2,3,4,5,6,7, and 8 hours from an arterial catheter. The animals were anesthetized with a mixture of ketamine hydrochloride and xylazine by intramuscular injection prior to insertion of catheters and allowed to recover for three days prior to administration of hemoglobin solutions.

FIG. 34 Map of plasmid pSGE1.0E4

FIGS. 35a and 35b Schematics showing how cysteine mutations can favor formation of octamer without genetic fusion of subunits.

FIG. 36 Proposed alpha₁ -beta₂ globin pseudodimer

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Hemoglobin Structure

The structure of conventional hemoglobin is well known. We herewith incorporate by reference the entire text of Bunn and Forget, eds., Hemoglobin: Molecular, Genetic and Clinical Aspects (W. B. Saunders Co., Philadelphia, Pa.: 1986) and of Fermi and Perutz "Hemoglobin and Myoglobin," in Phillips and Richards, Atlas of Molecular Structures in Biology (Clarendon Press: 1981).

The primary structure of a polypeptide is defined by its amino acid sequence and by identification of any modifications of the side chains of the individual amino acids.

About 92% of the normal adult human hemolysate is Hgb A (designated alpha2 beta2, because it comprises two alpha and two beta chains). The alpha chain consists of 141 amino acids (See FIG. 12). The iron atom of the heme (ferroprotoporphyrin IX) group is bound covalently to the imidazole of his 87 (the "proximal histidine"). The beta chain is 146 residues long (see FIG. 12) and heme is bound to it at his 92.

Other recognized hemoglobin species are Hgb A₂ (α₂ α₂ δ₂), Hgb A₁ a Hgb A₁ b and Hgb A₁ c, as well as the rare species Hgb F (a₂ gamma₂), Hgb Gower-1 (Zeta₂ epsilon₂), Hgb Gower-2 (alpha₂ epsilon₂), Hgb Portland (Zeta₂ gamma₂), and Hgb H (beta₄) and Hgb Bart (gamma₄). They are distinguished from Hgb A by a different selection of polypeptide chains.

Segments of polypeptide chains may be stabilized by folding into one of two common conformations, the alpha helix and the beta pleated sheet. In its native state, about 75% of the hemoglobin molecule is alpha-helical. Alpha-helical segments are separated by segments wherein the chain is less constrained. It is conventional to identify the alpha-helical segments of each chain by letters, e.g., the proximal histidine of the alpha chain is F8 (residue 8 of helix F). The non-helical segments are identified by letter pairs, indicating which helical segments they connect. Thus, nonhelical segment BC lies between helix B and helix C. In comparing two variants of a particular hemoglobin chain, it may be enlightening to attempt to align the helical segments when seeking to find structural homologies. For the amino acid sequence and helical residue notation for conventional human hemoglobin A₀ alpha and beta chains, see Bunn and Forget, supra, and Table 1 herein.

The tertiary structure of the hemoglobin molecule refers to the steric relationships of amino acid residues that are far apart in the linear sequence, while quaternary structure refers to the way in which the subunits (chains) are packed together. The tertiary and quaternary structure of the hemoglobin molecule have been discerned by X-ray diffraction analysis of hemoglobin crystals, which allows one to calculate the three-dimensional positions of the very atoms of the molecule.

In its unoxygenated ("deoxy", or "T" for "tense") form, the subunits of hemoglobin (alpha1, alpha2, beta1, and beta2) form a tetrahedron having a twofold axis of symmetry. The axis runs down a water-filled "central cavity". The subunits interact with one another by means of Van der Waals forces, hydrogen bonds and by ionic interactions (or "salt bridges"). The alpha1beta1 and alpha2beta2 interfaces remain relatively fixed during oxygenation. In contrast, there is considerable flux at the alpha1beta2 (and alpha2beta1) interface. In its oxygenated ("oxy", or "R" for "relaxed" form), the intersubunit distances are increased.

The tertiary and quaternary structures of native oxyhemoglobin and deoxyhemoglobin are sufficiently well known that almost all of the nonhydrogen atoms can be positioned with an accuracy of 0.5 Å or better. For human deoxyhemoglobin, see Fermi, et al., J. Mol. Biol., 175: 159 (1984), and for human oxyhemoglobin, see Shaanan, J. Mol. Biol., 171: 31 (1983), both incorporated by reference.

While analyses of hemoglobin structure tend to focus on the alpha-beta interfaces, it is known that the distance between the amino terminus of one alpha subunit and the carboxyl terminus of the other is about 5.6 Å in the deoxy configuration and 3.3 Å in the oxy configuration.

Definitions

For the purpose of the appended claims, a hemoglobin-like protein is an oxygen binding protein with a plurality of heme prosthetic groups and comprising one or more heterotetramers composed of (a) two alpha globin-like and two beta globin-like polypeptides, (b) one di-alpha globin-like and two beta globin-like polypeptides, (c) two alpha globin-like and one di-beta globin-like polypeptides, (d) one di-alpha globin-like and one di-beta globin-like polypeptides, (e) one fused alpha/beta globin-like polypeptide and separate alpha and beta globin-like polypeptides, or (f) two fused alpha/beta globin-like polypeptides. A polypeptide of one tetramer may be crosslinked or genetically fused to a polypeptide of another tetramer. A hemoglobin is said to be multimeric if it comprises more than four globin subunits or domains. The term "multimeric" thereby includes octameric hemoglobin (2 tetramers), as well as higher multimers.

A human alpha globin-like domain or polypeptide is native human alpha globin or a mutant thereof differing from the native sequence by one or more substitutions, deletions or insertions, while remaining substantially homologous (as hereafter defined) with human alpha globin, and still capable of incorporating heme and associating with beta globin. A beta globin-like domain or polypeptide is analogously defined. Subunits of animal hemoglobins or mutants thereof which are sufficiently homologous with human alpha or beta globin are embraced by the term "human alpha or beta globin-like domain or polypeptide." For example, the subunits of bovine hemoglobin are within the scope of these terms. The alpha- and beta-globin-like polypeptides may be referred to collectively as "globins". For the sake of convenience the term "polypeptide" may refer to a unitary chain or to a domain of a longer polypeptide chain.

A "genetically fused hemoglobin" is a hemoglobin-like protein comprising at least one "genetically fused globin-like polypeptide" (globin pseudooligomer), the latter comprising two or more globin-like domains which may be the same or different. A di-alpha globin-like polypeptide is one which consists essentially of two alpha-globin-like polypeptide sequences (domains) connected by peptide bonds between the normal C-terminus of the first alpha-globin-like polypeptide (domain) and the normal N-terminus of the second alpha-globin-like polypeptide (domain). These two sequences may be directly connected, or connected through a peptide linker of one or more amino acids; the term "peptide bonds" is intended to embrace both possibilities. Apha globin chains crosslinked at the N-and C-terminals other than by peptide bonds (e.g., by DIDS) are not di-alpha globins. The di-alpha globin-like polypeptide must be capable of folding together with beta globin and incorporating heme to form functional hemoglobin-like protein. The di-beta globin-like polypeptide is analogously defined. A di-alpha or di-beta globin-like polypeptide with a mutation in only one of the component domains is called "asymmetric".

It is also possible to provide an "alpha/beta-globin-like pseudodimer" in which an alpha globin-like domain is connected by peptide bonds to a beta globin-like domain sequence. This "alpha/beta globin-like polypeptide", and the di-alpha and di-beta globin-like polypeptides, may collectively be referred to as "pseudodimeric globin-like polypeptides" or as "diglobins". By extension, a hemoglobin-like protein comprising a di-alpha, a di-beta, or a alpha/beta globin-like polypeptide is a "pseudotetramer".

In determining whether a polypeptide is substantially homologous to alpha (or beta) globin, sequence similarity is an important but not exclusive criterion. Sequence similarity may be determined by conventional algorithms, which typically allow introduction of a small number of gaps in order to achieve the best fit. Preferably, the alpha-globin-like polypeptides or domains) of the present invention have at least about 75% sequence identity with wild-type human alpha globin. However, a polypeptide of lesser sequence identity may still be considered "substantially homologous" with alpha globin if it has a greater sequence identity than would be expected from chance and also has the characteristic higher structure of alpha globin and similar biological activity. (Note that, as elsewhere explained, an alteration in oxygen affinity (P50), intravascular retention, or cooperativity may be desired, and does not render the mutant nonhomologous if it can still contribute to reversible oxygen-binding activity.) By way of comparison, Artemia's heme-binding domains are considered homologous with myoglobin even though the primary sequence similarity is no more than 27%, as alignment of the heme-binding domains around their conserved residues and the residues conserved in other hemoglobins (i.e., involved in heme contacts or in determining the relationship of the helical segments to each other) suggested that the Artemia domains possessed the classical globin helices A to H with their corresponding turns, as well as various conserved globin family residues. Also, among the serine protease inhibitors, there are families of proteins recognized to be homologous in which there are pairs of members with as little as 30% sequence homology.

Over a hundred mutants of human hemoglobin are known, affecting both the alpha and beta chains, and the effect of many of these mutations on oxygen-binding and other characteristics of hemoglobin are known. The human alpha and beta globins themselves differ at 84 positions. In addition, interspecies variations in globin sequence have been extensively studied. Dickerson, Hemoglobin: Structure, Function, Evolution and Pathology, ch. 3 (1983) reported that in 1982, the 60 known vertebrate alpha globins had identical residues at 23 of their 141 positions, while for the 66 vertebrate beta globins considered, 20 of the 146 amino acids are identical. The 60 vertebrate myoglobins, which also belong to the globin family, had 27 invariant amino acids out of 153 positions. If only mammals are considered, then the invariant amino acids are 50/141 for the alpha globins, 51/146 for the beta globins, and 71/153 for the myoglobins. Invariant positions cluster around the centers of activity of the molecule: the heme crevice and the intersubunit contacts. Of the variable amino acids, some diverge from the consensus sequence for only a small fraction of the species considered.

The number of total differences between human alpha globin and selected other vertebrate alpha globins is as follows: rhesus monkey (4), cow (17), platypus (39), chicken (35), human zeta (embryonic) (61), carp (71), and shark (88). For invertebrate globins the divergences are sea lamprey (113), mollusc (124), Glycera (marine bloodworm) (124) and Chironomus (midge) (131). Turning to the beta globin family, the differences of human beta globin from other vertebrate beta globins are rhesus monkey (8), human delta globin (10), cow beta globin (25), cow gamma globin (33), human gamma globin (39), human epsilon (embryonic) globin (36), platypus (34), chicken (45), shark (96), sea lamprey (123), mollusc (127), Glycera (125) and Chironomus (128) Many of these differences may be misleading--variable amino acids may exhibit only "conservative substitutions" of one amino acid for another, functionally equivalent one. A "conservative substitution" is a substitution which does not abolish the ability of a globin-like polypeptide (or domain) to incorporate heme and to associate with alpha and beta globin subunits to form a tetrameric (or pseudotetrameric) hemoglobin-like protein which, in keeping with the definition thereof, will reversibly bind oxygen. The following resources may be used to identify conservative substitutions (and deletions or insertions):

(a) data on functional hemoglobin mutants (over a hundred such mutants exist);

(b) data on sequence variations among vertebrate, especially mammalian, alpha globins and beta globins;

(c) data on sequence variations among vertebrate, especially mammalian, myoglobins;

(d) data on sequence variations between vertebrate and invertebrate globins, or among the invertebrate globins;

(e) data on the three-dimensional structures of human hemoglobin and other oxygen-binding proteins, and molecular modelling software for predicting the effect of sequence changes on such structures; and

(f) data on the frequencies of amino acid changes between members of families of homologous proteins (not limited to the globin family). See, e.g., Table 1-2 of Schulz and Schirmer, Principles of Protein Structure (Springer-Verlag: 1979) and FIG. 3-9 of Creighton, Proteins: Structure and Molecular Properties (W. H. Freeman: 1983).

While the data from (a)-(d) is most useful in determining tolerable mutations at the site of variation in the cognate proteins, it may also be helpful in identifying tolerable mutations at analogous sites elsewhere in the molecule. Based on the data in category (f), the following exchange groups may be identified, within which substitutions of amino acids are frequently conservative:

I small aliphatic, nonpolar or slightly polar residues--Ala, Ser, Thr (Pro, Gly)

II negatively charged residues and their amides--Asn Asp Glu Gln

III positively charged residues--His Arg Lys

IV large aliphatic nonpolar residues--Met Leu Ile Val (Cys)

V large aromatic residues--Phe Tyr Trp

Three residues are parenthesized because of their special roles in protein architecture. Gly is the only residue without a side chain and therefore imparts flexibility to the chain. Pro has an unusual geometry which tightly constrains the chain. Cys can participate in disulfide bonds which hold proteins into a particular folding. Note that Schulz and Schimer would merge I and II above. Note also that Tyr, because of its hydrogen bonding potential, has some kinship with Ser, Thr, etc.

In general, functionality is less likely to be affected by mutations at surface residues, at least those not involved in either the heme crevice or the subunit contacts. In addition, "loops" connecting alpha helices, as well as free amino or carboxy termini, are more tolerant of deletions and insertions.

A "Met FX alpha globin" is an alpha globin-like polypeptide comprising an N-terminal methionine, a oligopeptide which acts as a recognition site for Factor Xa (e.g., Ile-Glu-Gly-Arg), and an alpha globin-like sequence (e.g., Val-His-Leu-Thr-Pro . . . ) which may correspond to wild-type alpha globin or to a mutant thereof as taught herein. The term "Met FX alpha globin" is sometimes abbreviated as "FX alpha globin". "FX beta globin" is an analogously defined beta globin-like polypeptide. "Met-alpha globin" is an alpha globin-like polypeptide with an extra N-terminal methionine. The second amino acid is valine, which is the first amino acid of mature wild-type alpha globin. Met-beta globin is analogously defined. A "Des-FX alpha globin" gene (or "dFX alpha globin") is a Met-alpha globin gene obtained by excising the FX codons from a Met-FX alpha globin gene. Note that "Met-Hgb" is used to refer to methionyl Hgb formed from methionyl-alpha globin and methionyl-beta globin.

"Des-Val-alpha globin" (or "dVal alpha globin") is an alpha globin-like polypeptide wherein methionine is substituted for the valine which begins the sequence of mature wild-type alpha globin. Des-Val-beta globin is analogously defined. Des-Val-alpha/alpha globin (di-Des-Val-alpha globin) is a "di-alpha globin" in which a "Des-Val-alpha" sequence is linked via an appropriate peptidyl linker to an alpha globin-like sequence which begins with Val.

The alpha and beta globin-like chains need not correspond exactly in sequence to the alpha and beta globins of "conventional" hemoglobin. Rather, mutations may be introduced to alter the oxygen affinity or stability of the hemoglobin, or the ease of expression and assembly of the individual chains. By way of example and not limitation, several mutant hemoglobins have been prepared by the method of this invention. Guidance as to further mutations is provided, e.g., by Hoffman and Nagai, U.S. Pat. No. 5,028,588, and Ser. No. 07/443,950, incorporated by reference herein.

The term "low affinity hemoglobin-like protein" refers to a hemoglobin-like protein having a P50 which is at least 10% greater than the P₅₀ of cell free hemoglobin A₀ under the same conditions. Preferably, the protein, if used as a blood substitute, qualifies as a low affinity protein, and more preferably, its P₅₀ is closer to the P₅₀ of whole blood cells than to that of cell free hemoglobin.

The term "high affinity hemoglobin-like protein" refers to a hemoglobin-like protein having a P₅₀ which is at least 10% less than the P₅₀ of cell free hemoglobin A₀ under the same conditions.

Low and High Affinity Mutants

Low affinity mutant hemoglobins, i.e., those with "right shifted" oxygen equilibrium binding curves relative to cell-free conventional hemoglobin, have many potential uses. Most notably, mutant hemoglobins that have an oxygen affinity similar to whole red blood cells may be used as an oxygen-carrying transfusion substitute in place of donated red blood cells, eliminating the risk of infection and alleviating problems with supply. Cell-free native human hemoglobin cannot function as a transfusion substitute, among other reasons because oxygen is bound too tightly. In addition, because cell-free hemoglobin solutions do not need to be cross-matched and are expected to have a longer shelf life than whole blood, low affinity hemoglobin solutions may be widely used in situations where whole blood transfusion is not feasible, for example in an ambulance or on a battlefield. Mutant hemoglobins that have an even lower oxygen affinity than red blood cells may in fact delivery oxygen more effectively in many situations. Mutant hemoglobins that have a somewhat higher oxygen affinity than whole blood (but a lower affinity than cell-free native human hemoglobin) will still function as an adequate transfusion substitute and may in fact deliver oxygen more effectively than red blood cells in some situations. This is because oxygen is released directly to plasma form hemoglobin-based solutions, without the need to diffuse through the red cell membrane, and because cell-free hemoglobin may penetrate into regions not accessible to red blood cells. As an example, low affinity mutant hemoglobin is expected to deliver oxygen effectively during coronary artery balloon angioplasty procedures, whereas circulation of red blood cells is obstructed during such procedures. Low affinity mutant hemoglobin may also be useful as a perfusion component in organ preservation prior to transplantation or as a mammalian cell culture additive.

High affinity mutant hemoglobin may have utility in other situations. For example, perfluorocarbon-based blood substitute preparations are under clinical study for enhancement of radiation therapy and certain chemotherapy treatments of solid tumors (Dowling, S., Fischer, J. J., and Rockwell, S. (1991) Biomat. Art. Cells Immobil. Biotech, 19, 277; Herman, T. S. and Teicher, B. A. (1991) Biomat. Art. Cells and Immobil. Biotech, 19, 395; Holden, S. A., Teicher, B. A. and Herman, T. S. (1991) Biomat. Art. Cells and Immobil. Biotech, 19, 399.) The basis of these investigations is the fact that oxygen is a required component of the cell toxicity action of radiation and certain chemotherapy reagents. Solid tumors frequently exhibit extremely low partial oxygen pressure in the interior of the tumor, rendering therapy inefficient. Perfluorocarbon-based oxygen-carrying solutions appear to dramatically enhance certain tumor therapies, and hemoglobin-based blood substitutes are expected to have a similar utility. It is likely that cell-free hemoglobin unlike whole red blood cells, will be able to penetrate the interior region of tumors for delivery of oxygen. Actual percent of oxygen released by a cell-free hemoglobin preparation is not a direct function of P₅₀ but rather depends on the shape of the oxygen equilibrium binding curve between the two pressures representing the partial oxygen pressure of the lungs (where oxygen is loaded onto hemoglobin) and the partial pressure of the tissue where oxygen is unloaded. Therefore, it is possible that a high affinity mutant hemoglobin would be preferred as a tumor therapy adjuvant. A high affinity hemoglobin would retain its bound oxygen throughout the normal circulatory system, where partial oxygen pressure remains relatively high, but release its oxygen in the extremely oxygen-depleted tumor interior. Normal or low affinity hemoglobin might have less hemoglobin available for release by the time it reaches the interior of the tumor.

Possible low affinity mutants are discussed in detail, by way of example and not of limitation, in Table 1 (natural low affinity hemoglobin mutants) and Table 2 (candidate non-naturally occurring low affinity hemoglobin mutants) of Hoffman, et al., U.S. Patent 5,028,588. Low affinity mutants of particular interest are the Presbyterian (beta Lys 108) beta Phe⁶³, beta Ile⁶⁷, and Kansas (beta Thr¹⁰²) mutants.

Naturally occurring high affinity hemoglobin mutants are also known, see Bunn and Forget, Table 14-1, and candidate non-naturally occurring high affinity hemoglobin mutants may be proposed in view of the known mutants and hemoglobin structure. For example, a naturally occurring high affinity hemoglobin mutant is hemoglobin Providence having a Lys82Asn substitution in the human beta globin protein. Particularly preferred high affinity mutants are set forth in Table 400.

It should be noted that genetic fusion and crosslinking can affect oxygen binding affinity.

Cysteine Mutations and Disulfide Bridge Formation

Cysteine mutations are of value for increasing the stability of the tetramer (See U.S. Pat. No. 5,028,588 and Ser. No. 07/443,950. They also facilitate constructing poly(tetrameric) (n≧2) hemoglobins with increased intravascular half-life. This is because the cysteines on adjacent tetramers (including pseudotetramers) can be oxidized to form a disulfide bridge, covalently coupling the tetramers.

A variety of sites are available for introduction of cysteines into a hemoglobin-like protein.

The criteria governing site selection are: (1) the mutation does not affect functionality; (2) the side chain is accessible to water in oxy or deoxy structure; (3) the site should lie on the surface of the folded protein; (4) the sulfhydryl of the side chain should extend away from the surface rather than toward the interior of the molecule; (5) the site should be in a portion of the molecule that is not directly involved in the R→T transition; (6) the change should be in a portion of the molecule that does not have a tightly fixed position (such regions generally give indistinct X-ray diffraction patterns); (7) the mutations will not destroy the local secondary structure, i.e., avoid pro-→cys mutations, which might result in a refolding problem; and (8) if possible, a conservative change should be made such as ser→cys or ala>cys. A mutation does not necessarily have to meet all of the above requirements to be useful. For example, one might envision a site that is involved in the R→T transition (cf. 5 above) but confers a beneficial change in P₅₀ (cf. 1 above) because of that involvement. The most important considerations are that the mutation does not abolish O₂ binding and that the cysteine is accessible for participation in disulfide bonding.

Candidate sites on the alpha surface include:

his72, asn 78, asn68, ala71, thr67, lys7, lys11, thr8, ala12, thr118, lys16, ala45, glu116, gly15, his112, thr24, glu23, lys60, lys56, his50, gly51, glu53, ser49, asp47, gln54, his45, lys90, ala82, lys61, ala19, his20, asp85, ser81, asp75, asp74, lys139, asp64, and gly18 (total 40 amino acids).

Candidate sites on the beta surfaces includes:

asp79, his2, leu3, thr4, glu6, ser9, thr12, ala13, gly16, lys17, val18, asn19, val20, asp21, glu22, lys65, ser72, ala76, his77, asp79, asn80, gly83, ala86, thr87, glu90, lys95, lys59, glu43, ser44, asp47, ser49, thr50, ala53, asp52, lys61, glu121, lys 120, thr123, lys66, asp73, ala62, his116, his117 (total 45 amino acids).

There are a number of naturally occurring mutants which already know mutations at these sites. These are listed below:

    ______________________________________                                         Residues      Region      Mutation                                             ______________________________________                                         19            AB1         ALA-->GLU                                                                      ALA-->ASP                                            54            E3          GLN-->ARG                                                                      GLN-->GLU                                            71            E20         ALA-->GLU                                            75            EF4         ASP-->GLY                                                                      ASP-->HIS                                                                      ASP-->TYR                                                                      ASP-->ASN                                            81            F2          SER-->CYS                                            47            CE5         ASP-->GLY                                                                      ASP-->HIS                                                                      ASP-->ASN                                            ______________________________________                                    

If the pseudo-octamer (n=2) is formed by directly linking two pseudo-tetramers via a disulfide bond, the halflife in serum may be influenced by the rate at which endogenous serum small molecule thiols (such as glutathione) reduce the disulfide bond. The mechanism of these reactions involves the thiolate anion as the actual reducing species (Creighton, T. E. (1978) Prog. Biophys. Molec. Biol., 33:259-260: Creighton, T. E. (1975) J. Mol. Biol., 96:767; Creighton, T. E. (1977) J. Mol. Biol., 113:313). Thus the rate of reduction will be a function of the molecular electrostatic environment in the vicinity of the disulfide bond. A slower rate of reduction would be predicted if the disulfide was located in an electrostatically negative environment,, due to the repulsion of the thiolate anion. In the case of glutathione, even the unreactive transient protonated species has a net negative charge and would be repulsed, thus further reducing the rate of disulfide reduction.

A surface or near-surface amino acid residue of di-alpha or di-beta hemoglobin that is located in close proximity to a negatively charged surface residue might therefore be a good choice for location of a single cysteine mutation in the di-alpha or di-beta polypeptide. Although formation of the initial disulfide bond between two such cysteines might also be slower because of repulsion between the negative charges on the two hemoglobin molecules in the vicinity of the cysteines, the reaction could be facilitated by use of high salt or high pH during the in vitro bond formation reaction. If carried out under deoxy conditions in a redux buffer, the reaction might also be facilitated by temperature elevation.

Preferred Sites for cys Mutations Proximal to Negative Charged Residues

    ______________________________________                                         alpha ser49                                                                               near asp47 naturally occurring ser49 to arg                                               has normal O.sub.2 affinity                              alpha his20                                                                               near glu23 naturally occurring his20 to tyr,                                              gln, arg have no known                                                         undesirable properties                                   alpha lys16                                                                               near glu116                                                                               naturally occurring lys to glu has                                             normal O.sub.2 affinity                                  alpha his50                                                                               near glu30 naturally occurring his50 to asp                                               has no known undesirable                                                       properties                                               beta thr50 near asp52 naturally occurring thr50 to lys                                               has no known undesirable                                                       properties                                               beta lys65 near asp21                                                          beta asn19 near asp21                                                          ______________________________________                                    

Surface or near-surface cysteine mutations in general are not expected to have major effects on the functionality of the hemoglobin pseudotetramer. Cysteine mutations would not be expected to significantly destabilize alpha helices, and surface residues are not directly involved in the oxygen binding properties of hemoglobin. Most surface residues undergo considerable motion and are not tightly constrained. It should also be noted that because of protein breathing motions, the cysteine side chain would not necessarily have to point directly into solution to be accessible for disulfide bond formation.

In addition to the use in construction of a pseudooctamer, there may be additional uses of surface cysteine mutations. These include: (1) construction of multimeric hemoglobins (n>2) by use of synthetic sulfhydryl reactive peptides with more than two reactive sites; (2) surface cysteine residues could be used to attach chelates that bind radioisotopes for imaging; and (3) surface cysteines could be used to attach bio-active peptides or other therapeutic agents to increase their circulating half-life, or target their delivery. If the attachment of the drug were via a disulfide, the rate of release of the peptide from its carrier could be controlled by neighboring residues. For uses (2) and (3), restriction to one cysteine per di-alpha or di-beta is unnecessary.

Gene Construction and Expression

The DNA sequences encoding the individual alpha (or di-alpha) and beta (or di-beta) globin chains may be of genomic, cDNA and synthetic origin, or a combination thereof. Since the genomic globin genes contains introns, genomic DNA must either be expressed in a host which can properly splice the premessenger RNA or modified by excising the introns. Use of an at least partially synthetic gene is preferable for several reasons. First, the codons encoding the desired amino acids may be selected with a view to providing unique or nearly unique restriction sites at convenient points in the sequence, thus facilitating rapid alteration of the sequence by cassette mutagenesis. Second, the codon selection may be made to optimize expression in a selected host. For codon preferences in E. coli, see Konigsberg, et al., PNAS, 80:687-91 (1983). For codon preferences in yeast, see the next section. Finally, secondary structures formed by the messenger RNA transcript may interfere with transcription or translation. If so, these secondary structures may be eliminated by altering the codon selections.

Of course, if a linker is used to genetically crosslink subunits, the linker will normally be encoded by a synthetic DNA. While the di-alpha globin and the beta globin may be expressed separately and then combined with each other and heme in vitro, they are preferably placed on one plasmid.

The present invention is not limited to the use of any particular host cell, vector, or promoter. The host cell may be a microbial, plant or animal cell, and, if a plant or animal, it may be in cell, tissue or organ culture, or a cell of a transgenic or chimeric plant or animal. However, the preferred host cells are bacterial (especially, E. coli) and yeast (especially S. cerevisiae) cells. The promoter selected must be functional in the desired host cells. It preferably is an inducible promoter which, upon induction, provides a high rate of transcription. A preferred bacterial promoter is the Tac promoter, a trp/lac hybrid described fully in DeBoer, U.S. Pat. No. 4,551,433 and commercially available from Pharmacia-LKB. Other promoters which might be used include the temperature sensitive lambda P_(L) and P_(R) promoters, as well as the lac, trp, trc, pIN (lipoprotein promoter and lac operator hybrid), gal and heat shock promoters. The promoter used need not be identical to any naturally-occurring promoter. Guidance for the design of promoters is provided by studies of promoter structure such as that of Harley and Reynolds, Nucleic Acids Res., 15:2343-61 (1987) and papers cited therein. The location of the promoter relative to the first structural gene may be optimized. See Roberts, et al., PNAS (USA), 76:760-4 (1979). The use of a single promoter is favored. Suitable yeast expression systems are described in detail elsewhere in this specification.

The vector used must be one having an origin of replication which is functional in the host cell. It desirably also has unique restriction sites for insertion of the globin genes and the desired regulatory elements and a conventional selectable marker. A vector may be modified to introduce or eliminate restriction sites to make it more suitable for further manipulations.

The alpha and beta globin chains may be expressed either directly or as part of fusion proteins. When expressed as fusion proteins, the latter may include a site at which they may be cleaved to release the alpha and beta globin free of extraneous polypeptide. If so, a site sensitive to the enzyme Factor Xa may be provided, as taught in Nagai and Thorgenson, EP Appl 161,937, incorporated by references herein. Alternatively, the alpha and beta fusion proteins may be synthesized, folded and heme incorporated to yield a hemoglobin analogue.

The direct expression of the alpha and beta globin subunits is desirable. Factor Xa is a blood derivative. Preparations of Factor Xa may therefore contain undesirable blood-associated substances or etiologic agents. In any event, the hemoglobin must be separated from the Factor Xa.

Nagai and Thorgerson, EP Appl 161,937, incorporated by reference herein, teach the construction of fused genes in which DNA coding for a polypeptide of interest is immediately preceded by DNA encoding a cleavage site for Factor Xa, a serine protease. Certain of the peptide sequences to be cleaved by Factor Xa are quoted below (wherein the cleavage site is denoted by an "="):

Ile-Glu-Gly-Arg=Val-His-Leu-Thr CII Fxβ-globin

Ile-Glu-Gly-Arg=Thr-Ala-Thr-Ser Hu prothrombin

Ile-Glu-Gly-Arg=Thr-Ser-Glu-Asp Bo prothrombin

Ile-Asp-Gly-Arg=Ile-Val-Glu-Gly Hu prothrombin

Ile-Glu-Gly-Arg=Ile-Val-Glu-Gly Bo prothrombin

Ala-Glu-Gly-Arg=Asp-Asp-Leu-Tyr Hu antithrombin III

In the above list, "CIIFXβ-globin" refers to a hybrid fusion protein comprising the 31 amino-terminal residues of the lambdaCII protein, the Factor Xa recognition sequence "Ile-Glu-Gly-Arg," and the complete amino acid sequence of human beta globin (which begins "Val-His-Leu-Thr- . . . "). It will be evident from study of FIG. 4 of the present invention that FX-alpha and FX-beta globins of Example 1 correspond to the native globin preceded by "Met-Ile-Glu-Gly-Arg."

In bacterial mRNA, the site at which the ribosome binds to the messenger is a polypurine stretch which lies 4-7 bases upstream of the start (AUG) codon. The consensus sequence of this stretch is 5'. . . AGGAGG . . .3', and is frequently referred to as the Shine-Dalgarno sequence. Shine and Dalgarno, Nature, 254: 34 (1975). The exact distance between the SD sequence and the translational start codon, and the base sequence of this "spacer" region, affect the efficiency of translation and may be optimized empirically. Shepard, et al., DNA 1: 125 (1985); DeBoer, et al., DNA 2: 231 (1983); Hui, et al., EMBO J., 3: 623 (1984).

In addition, the SD sequence may itself be modified to alter expression. Hui and DeBoer, PNAS (USA), 84:4762-66 (1987-). Comparative studies of ribosomal binding sites, such as the study of Scherer, et al., Nucleic Acids Res., 8:3895-3907 (1987), may provide guidance as to suitable base changes. If the hemoglobin is to be expressed in a host other than E. coli, a ribosomal-binding site preferred by that host should be provided. Zaghbil and Doi, J. Bacteriol., 168:1033-35 (1986).

Any host may be used which recognizes the selected promoter and ribosomal binding site and which has the capability of synthesizing and incorporating heme. Bacterial and yeast hosts are preferred.

The intracellularly assembled hemoglobin may be recovered from the producing cells and purified by any art-recognized technique.

Polycistronic Co-Expression of Alpha and Beta Globins and Their Assembly into Hemoglobin

In one embodiment, expression of the alpha and beta globin genes is driven by a single promoter, and the genes are arranged so that a polycistronic messenger RNA transcript is transcribed, from which the separate alpha and beta globin polypeptides are subsequently translated. However, the present invention includes the co-expression of the alpha and beta globin genes from separate promoters, i.e., the host transcribes separate alpha and beta globin mRNAs.

The use of a single promoter is favored on theoretical grounds. Ideally, alpha and beta globin are expressed in stoichiometrically equal amounts. While use of a single promoter does not guarantee equality, in eliminates one unbalancing influence--differences in transcription owing to differences in promoter strength and accessibility. If differences in promoter strength were minimized by use of two identical promoters on the same plasmid, plasmid stability would be reduced as there would be a propensity toward recombination of the homologous regions. We note, however, that in preliminary experiments we have co-expressed alpha and beta globins from separate promoters.

Another justification for using a single promoter is to minimize the number of repressor binding sites.

Preferably, the alpha and beta globin genes are arranged so that the ribosome will translate the alpha globin cistron first. The rationale is that there is some basis for believing that alpha globin affects the folding of beta globin. Nonetheless, the position of the genes may be switched so that beta globin is synthesized first, as is shown in Example 6.

The stability of the polycistronic mRNA transcript, the efficacy of its translation into alpha and beta globin, and the folding of the globin chains into tetrameric hemoglobin may be modified by varying the length and base sequence of the intercistronic regions (the region lying between the stop codon of one cistron and the start codon of the next cistron), the phasing of a second cistron relative to a first cistron, and the position and sequence of the ribosomal binding site for the one cistron relative to the preceding cistron.

In a preferred embodiment, the alpha and beta globin genes are each preceded by a short "introductory" cistron or "ribosomal loader" which facilities the subsequent translation of the globin cistron. In FIG. 4, region A contains two cistrons and a Shine-Delgarno sequence preceeding each cistron. The first Shine-Delgarno sequence (SD#1) is bound by the ribosome, which then translates the first cistron, a short cistron encoding an octapeptide. (This cistron is referred to as an "introductory cistron or ribosomal loader.) The second cistron is a globin gene, in this case, an FX alpha-globin gene. The Shine-Delgarno sequence (SD#2) for facilitating translation of the second cistron actually lies within the first cistron. For this reason, the two are said to be "translationally coupled". Region B is identical in structure, except that the second cistron encodes FX-beta globin. Between regions A and B is a 43-base intercistronic region. The introductory cistrons of regions A and B correspond to the first cistron of the two-cistron expression system denoted pCZ144 in Schoner, et al., Meth. Enzymol., 153: 401-16 (1987). The present invention is not, however, limited to the particular "starter" cistron taught by Schoner, et al.; other introductory cistrons that allow for restart of high level translation of a following cistron may be used.

Guidance as to the design of intercistronic sequences and as to the location of SD sequences may be obtained by comparing the translational efficiency of spontaneous or controlled mutants of the same polycistronic operon, as exemplified by Schoner, et al., PNAS, 83: 8506-10 (1980). It is also possible to look for consensus features in the intercistronic regions of different operons. McCarthy, et al., EMBO J., 4: 519-26 (1985) have identified a translation-enhancing intercistronic sequence in the E. coli atp operon.

The present invention is intended to reduce or avoid the localization of the hemoglobin or its component polypeptides into inclusion bodies. Consequently, a further feature of the invention is that the functional hemoglobin is substantially found (preferably over 80%) in the soluble fraction of the cell. It appears that with this invention, over 90% of the functional hemoglobin can be so directed when alpha₂ beta₂ hemoglobin is assembled from alpha- and beta-globin chains co-expressed from a tetracistronic operon as described herein. With di-alpha, beta₂ hemoglobin, nearly 100% is soluble when expression is induced at 25° C. and less at higher induction temperatures. These percentages reflect the percent of all di-alpha and beta chains found in the soluble fraction of the cell and not actual recovery of protein from the cell.

Expression in Yeast

In another embodiment the present invention relates to the production of hemoglobin-like molecules in yeast. Our preferred host for expression of recombinant human hemoglobin in yeast is Saccharomyces cerevisiae. However, other fungi or yeast may be used for the purpose, such as strains of Aspergillus or Pichia. For yeast to be a suitable host it must be capable of being transformed with recombinant vectors, either replicating or integrating types. This allows the insertion of the desired DNA sequence for the gene of interest. It must also be capable of high density cell growth, in appropriate volume to provide sufficient cell mass to isolate the desired gene product from the desired reaction vessels, where ideally the growth would be easily controlled by several parameters including nutrient formulation, agitation and oxygen transfer and temperature. It is also desirable to be able to induce the expression of protein synthesis with the manipulation of the media, temperature, or by the addition or consumption of certain chemicals. Finally, to be a suitable host, the yeast must be capable of producing recombinant proteins, preferably in excess of 1% of the total cell protein. This allows more facile isolation of the desired recombinant protein.

With reference to S. cerevisiae, haploid strains of potential use include:

    ______________________________________                                         BJY3501  Matα pep4::HIS3 prb1-D 1.6R his3 200 ura3-52                             can1 GAL,                                                             GSY112   As above but Leu2::HISG                                               GSY112 cir°                                                                      As above but cured of 2μ plasmid                                   BJY3505  Mata pep4::HIS3 prb1-D 1.6R HIS3 lys2-208                                      trp1-101 ura3-52 gal2 can1                                            GSY113   As above but leu2::HISG                                               RSY330   Matα pep4-3 prb1-1122 hist7 ura3-52 trp1-289                             can1 gal1                                                             BJY2168  Mata prc1-407 prb1-1122 pep4-3 leu2 trp1 ura3-52                      BJY1991  Matα prb1-1122 pep4-3 leu2 trp1 ura3-52                         RSY334   Matα reg1-501 pep4-3 prb1-1122 ura3-52 leu2-3,                           112 gal1                                                              RSY214   Matα pep4-3 prb1 ura3-52                                        ______________________________________                                    

To date, strains such as GSY112, GSY113 and RSY334 have been the best hemoglobin producers. Strains such as RSY334 that carry the reg1-501 mutation may be particularly important as they uncouple glucose repression from galactose induction, allowing one to induce with lower levels of galactose in the presence of glucose. Because this strain carries the gall mutation it cannot metabolize galactose, so the galactose concentration remains constant and continues to function as a inducer.

Diploid strains formed from crosses of any of the above compatible strains or similar compatible strains may also be useful as they tend to have faster growth rates than haploid strains.

For example, the following diploid strains, which co-express alpha (or di-alpha) and beta globins, are described in the Examples:

BJY3505 pGS4988! × RSY330 pGS4688!

BJY3505 pGS4988! × BJY 1991 pGS4688!

Other matings may likewise be used in practicing the present invention.

The use of protease-deficient strains may also be advantageous.

Yeast expression systems can be divided into two main categories: (1) Systems designed to secrete protein and (2) system designed for the cytoplasmic expression of proteins. The advantages of secretion systems are:

(1) The protein is often easier to purify from culture medium than from total cell extracts.

(2) If the proteins has essential disulfide bonds, they are more likely to form if the protein passes through the secretory pathway. This is thought to be partly due to the presence of protein disulfide isomerase in the endoplasmic reticulum and to a less reducing environment than in the cytoplasm.

(3) Secretion can also be advantageous if the first amino acid (after the initiating methionine) creates a dipeptide sequence that is poorly processed by methionyl aminopeptidase. The addiction of a secretory signal sequence may allow processing by signal peptidase during secretion resulting in a protein with the authentic amino acid at the amino terminus of the protein.

The disadvantages of a secretion system are:

(1) Generally, the expression level are much lower than those that can be obtained by cytoplasmic expression. This seems to be, in part, due to a rate limiting step in secretion.

(2) Not all proteins are secretable.

(3) Often, particularly with S. cerevisiae, misprocessed forms of the protein accumulate and these can be difficult to purify away from correctly processed forms.

The secretory expression system most commonly used in yeast is the recombinant α-factor secretory signal sequences and the α-factor promoter. See A. J. Brake. Yeast Genetic Engineering Eds. P. Barr, A. Brake, and P. Valenzuela. Butterworth Publishing, Boston 1989, for a review. The invertase signal sequence coupled to a variety of promoters has also been used to express heterologous proteins in yeast. See G. Stetler et al., Biotechnology, 7:55-60 (1989), R. Smith et al., Science, 229:1219-1229 (1985).

The advantages of cytoplasmic expression are:

(1) Expression levels can be quite high with reports of proteins expressed at >20% of the total cell protein, usually as a soluble protein.

(2) Some proteins cannot be efficiently secreted.

(3) Proteins that contain glycosylation sites, if secreted from yeast, will be glycosylated with the pattern of sugar s unique to yeast. Because these highly hydrophilic, post-translational modifications lie on the surface of proteins they are likely to confer antigenicity on the protein. If the protein is functional without the carbohydrate side chains it may be advantageous to produce the protein without them rather than with the yeast-specific modification (see for ex. Travis et al. 1985. J Biol Chem 260:4384-4389).

(4) Some proteins have other specific modifications that may occur only in the cytoplasm, for example aminoterminal acetylation, modification with lipids, perhaps heme incorporation and so forth.

At present, cytoplasmic expression is preferred since the yeast cells fold together the globin chains and incorporate heme to produce hemoglobin in vivo. However, it is possible to separately express and secrete the alpha and beta globin chains and assemble hemoglobin in vitro.

The globin genes must be placed under the control of a suitable promoter. The commonly used yeast promoters generally fall into two broad categories: regulated and constitutive. Constitutive promoters that are in wide use include GAP, PGK (phosphoglycerate kinase) and the α-factor promoter. Regulated promoters have also been used and these include the yeast metallothionein promoter (regulated by copper), the Gal-10 promoter, GAL7 promoter (regulated by galactose and glucose) the ADHII promoter (regulated by ethanol and glucose) the PHO5 promoter (phosphate regulation) and several hybrid promoters such as PHO5-GAP, GAL-PGK, ADHII-GAP, and GAL-CYC1.

The use of regulated promoters may be important for plasmid stability. Often expression of recombinant proteins, at high levels, inhibits the growth of the host organism. This disadvantage can result in the accumulation of cells in the population with few copies of the plasmid present, a decrease in growth rate and a drop in overall expression levels. By maintaining the gene in a repressed state and inducing late in the fermentation, high growth rates can be maintained and selection against high plasmid copy numbers can be avoided.

It is somewhat difficult to obtain accurate data on the relative strength of yeast promoters because the only true measure of this would be based on kinetics of mRNA synthesis. Most of the data that exists measures only the final protein concentration that has been obtained. Because there can be great differences in protein stability, it is necessary to compare the same protein and mRNA with multiple promoters on identical vectors. A study that approaches this was done by Verbakel et al. (Gene, 61:207-215 (1987)). They compared the GAPDH, CYC1 GAL7, PHO5, and PGK promoters and measured the expression of a β-galactosidase/poliovirus VP2 protein fusion. CYC1 resulted in an expression level of 0.14% of the total cell protein (TCP); GAPDH, 0.22% TCP; PHO5, 0.26% TCP; PGK, 0.9% TCP; and GAL7, 0.96% TCP.

Using our GALGAP hybrid promoter, we have obtained expression levels of hemoglobin that represents ≧15% of the total cell protein. This most probably represents an increased promoter strength along with a highly stable protein product and perhaps MRNA with an extended half life.

The use of a GAL-GAP hybrid promoter is preferred. Both elements (the GAL_(UAS) and the GAP transcriptional initiation site) are well understood. Studies on the mechanisms of transcriptional regulation of the GAL regulon have been fairly extensive. The galactose regulon includes five genes that encode enzymes required for the utilization of galactose. Four of these genes (GAL1, GAL7, GAL10, and GAL2) are expressed only in the presence of galactose. Galactose induction does not occur in the presence of glucose unless the yeast strain bears a mutation in the REG1 gene. The GAL1, 7, 10 and 2 genes are regulated by at least two other genes, GAL80 and GAL4. The GAL4 gene is a transcriptional activator protein that activates mRNA synthesis from the GAL1, 7, 10 and 2 upstream activator sequences (UASGAL). Although GAL4 is constitutively expressed, it is functionally silent in the absence of galactose. Repression of GAL4 activity, in the absence of galactose is maintained by the product of the GAL80 gene. The GAL80 protein apparently interacts physically with GAL4 to prevent transcriptional activation. Presumably galactose or a galactose derivative prevents this interaction to allow GAL4 mediated induction.

Haploid strains of S. cerevisiae have three different genes encoding the enzyme glyceraldehyde-3-phosphate dehydrogenase (GAP). These genes have been designated TDH1, TDH2 and TDH3 and each is present as a single copy per haploid genome. The TDH3 gene produces approximately 60% of the cell's GAP enzyme and TDH1 and 2 produce about 12% and 28%, respectively (McAllister, L and M. J. Holland, 1985. J. Biol Chem, 260: 15019-15027). Holland's group (Holland et al. 1981. J. Biol Chem, 256:1385-1395; and Holland et al. 1983. J Biol Chem 258:5291-5299) has cloned and characterized the three GAP genes of S. cerevisiae. The clones have been designated pGAP11, pGAP63, and pGAP491. pGAP491 corresponds to the TDH3 gene and is therefore, the most highly expressed. This promoter has been used to express a wide variety of proteins in yeast including:

    ______________________________________                                         Protein          REF                                                           ______________________________________                                         α Interferon                                                                              (1)                                                           Hepatitis B antigen                                                                             (1)                                                           Thaumatin        (2)                                                           Hepatitis B antigen                                                                             (3)                                                           HIV III reverse  (4)                                                           transcriptase                                                                  human SOC        (5)                                                           α1 antiprotease                                                                           (6)                                                           ______________________________________                                          (1) Bitter, G. and K. M. Eagan. 1984. Gene, 32.263-274.                        (2) Edens, L. et al. 1984. Cell, 37:629-633.                                   (3) Kitano, K. et al. 1987. Biotechnology 5:281-283.                           (4) Hallewell, R. A. et al. 1987. Biotechnology 5:363-366.                     (5) Barr, P. J. et al. 1987. Biotechnology 5:486-489.                          (6) Travis, J. et al. 1985. J Biol Chem 260:4384-4389.                   

This promoter is commonly used as a 600-850 bp fragment and is essentially un-regulated. In its long form this is a very powerful promoter. The form we are using consists of only ⁻⁻ 200 bp 5' of the translational initiation site. This form, with no added enhancer sequences is substantially less active than the longer form of the promoter (Edens, L. et al. Cell, 37:629 (1984)). Our addition of the GAL enhancer region confers both regulation and high levels of expression. With only the GAP491 promoter, alpha and beta globin were produced at a level of less than 0.2% total cell protein; with the GAL-GAP491 hybrid promoter, expression jumped to 7-10% total cell protein.

Several other hybrid promoters are of particular interest:

GAL-SIGMA

A strong galactose-regulated promoter with the sigma transcriptional start site.

SIGMA-GAP

A strong peptide hormone-regulated promoter with the GAP491 transcriptional start site.

GAL-EF III

A strong galactose-regulated promoter with the elongation factor III transcriptional start site.

SIGMA-EF III

A strong peptide hormone-regulated promoter with the elongation factor III transcriptional start site.

One could easily conceive of other promoter systems that would also work. This would include, but not be limited to, a variety of constitutive promoters. For example, the yeast mating factors (MFα) promoter or the mating factor a promoter MF(a), the phosphoglycerate kinase promoter (PGK), hexokinasel, hexokinase2, glucokinase, pyruvate kinase, triose phosphate isomerase, phosphoglycerate isomerase, phosphoglycerate mutase, phosphofructose kinase or aldolase promoters may all be used. In short, any well expressed yeast promoter may work for expression of hemoglobin in yeast. A wide variety of naturally occurring, regulated promoters could also be used, for example: GAL1-10, GAL7, PHO5, ADHII have all been used to produce heterologous proteins in yeast. A variety of synthetic or semi-synthetic yeast promoters could also be employed such as GAL-PGK, GAL-MFα-1, GAL-MFa1, GAL-SIGMA. ADHII regulatory sequences could also be coupled to strong transcriptional initiation sites derived from a variety of promoters. The PHO5 regulatory sequence or the sigma element regulatory sequences could also be used to construct powerful hybrid promoters. In addition to yeast promoters, it is conceivable that one could use a powerful prokaryotic promoter like the T7 promoter. In this case, one could place the T7 polymerase under the control of a tightly regulated yeast promoter. Induction of the phage polymerase in yeast cells bearing hemoglobin genes under T7 promoter regulation would allow transcription of the genes by this very efficient phage polymerase.

Because most of the yeast regulatory sequences described above serve as targets for proteins that are positive regulators of transcription, it is conceivable that these proteins may limit transcription in situations where the target sequence is present in many copies. Such a situation may obtain with vectors such as pC1B, pCIT, pC1U or pC1N which may be present in excess of 200 copies per cell. Over-expression of the positive regulator (for example GAL4) may result in enhanced expression. It is possible to construct a strain in which the GAL4 gene is altered to remove its promoter and the promoter replaced with the GAL7 or GAL1-10 promoters, both of which are transcribed more efficiently than the GAL4 promoter. In this situation, the positive transcriptional activator protein GAL4 would be expressed at elevated level at the time hemoglobin expression was induced.

The consensus sequence for higher eukaryotic ribosome binding sites has been defined by Kozack (Cell, 44:283-292 (1986)) to be: GAAGCCAUGG. Deviations from this sequences, particularly at the -3 position (A or G), have a large effect on translation of a particular mRNA. Virtually all highly expressed mammalian genes use this sequence. Highly expressed yeast mRNAs, on the other hand, differ from this sequence and instead use the sequence AAAAAUGU (Cigan and Donahue, Gene, 59:1-18 (1987)). The ribosome binding site that we use for expression of the α and β-globins corresponds to the higher eukaryotic ribosome binding site. It is within the contemplation of this invention to systematically alter this RBS to test the effects of changes that make it more closely resemble the RBS of yeast. It should be pointed out, however, that alterations at the -2, -1 and +3 positions, in general, have been found to only slightly affect translational efficiency in yeast and in mammals.

Intracellular expression of genes in S. cerevisiae is primarily affected by the strength of the promoter associated with the gene, the plasmid copy number (for plasmid-borne genes), the transcription terminator, the host strain, and the codon preference pattern of the gene. When secretion of the gene product is desired, the secretion leader sequence becomes significant. It should be noted that with multicopy plasmids, secretion efficiency may be reduced by strong promoter constructions. Ernst, DNA 5:483-491 (1986).

A variety of extrachromosomally replicating vectors (plasmids) are available for transforming yeast cells. The most useful multicopy extrachromosomal yeast vectors are shuttle vectors that use a full length 2μ-circle combined with an E. coli plasmid. These vectors carry genes that allows one to maintain the plasmid in appropriate yeast mutants and antibiotic resistance markers that allow selection in E. coli. Use of the full-length 2μ-circle, in contrast to vectors containing only a partial 2μ sequence, generally results in much higher plasmid stability, particularly in yeast strains that have been cured of endogenous 2μ plasmid. The pC series of vectors described herein are vectors of this type.

Strains could also be constructed in such a way that the GALGAP hemoglobin expression cassettes were integrated into chromosomes by using yeast integrating vectors. Although the copy number of the hemoglobin genes would be lower than for plasmid vectors, they would be quite stable and perhaps not require selection to be maintained in the host cell. Yeast integrating vectors include Yip5 (Struhl, et al, PNAS, 76:1035-39, 1989), Yip1 (Id.), and pGT6 (Tchumper and Carbon, Gene, 10:157-166, 1980). For information on these and other yeast vectors, see Pouwels, et al., Cloning Vector, VI-I, et sea. (Elsevier, 1985).

The alpha and beta globin genes may be introduced by separate plasmids, or both upon the same plasmid. The advantage of a single plasmid system over a double plasmid system is theoretical. It is generally thought that there is an upper limit to the total number of plasmid copies per cell. If it is 1000, for example, the two plasmid system could have only 500 copies of α-chain plasmid and 500 of the β-chain plasmid. A single plasmid of 1000 copies per cell would bear 1000 copies of each α- and β-chain gene. The number of copies may be irrelevant, however, if other factors are limiting. In fact, several groups favor using strains that contain genes integrated into various chromosomal loci. Such strains very stably maintain the foreign gene and do not require special mediα to maintain selection for the plasmid.

Highly expressed yeast genes show a very high codon bias. The genes encoding glyceraldehyde-3-phosphate dehydrogenase and ADH-I, for example, show a 90% bias for a set of 25 codons. Highly expressed yeast genes (>1% of the total mRNA) have yeast codon bias indices of >.90. Moderately expressed genes (0.1-.05% of the total mRNA) have bias indices of 0.6-0.8, and genes expressed at low levels (>0.05% of the total cell protein) have a codon bias of 0.10-0.50 (Bennetzen and Hall, J. Biol. Chem., 257:3026-3031. (1982)). The calculated value for the codons of the human α-globin cDNA is 0.23. A similar value can be calculated for the β-globin cDNA. Because there is a very high correlation between the most commonly used codons, it is possible that hemoglobin expression from the human cDNA in yeast may be limited by the availability of the appropriate tRNA molecules. If this is so, a complete synthesis of the gene using the most highly favored yeast codons could improve the expression levels. It is quite possible that the greatest negative effect of adverse codon use would be if there was an abundance of codons used in the cDNA that are represented by low abundance tRNAs. In such a case, high level expression of hemoglobin could completely drain that pool of tRNA molecules, reducing translation not only of hemoglobin but of yeast proteins that happen to use that codon as well. In the case of the α-globin human cDNA, the most commonly used leucine codon is CTG (14 of 21), this codon is never used in highly expressed yeast genes (Guthrie and Abelson, The Molecular Biology of the Yeast Saccharomyces, Eds. Stratern, Jones and Broach, 1982. Cold Spring Harbor, N.Y.). The low codon bias index and the presence of rare yeast codons in the globin cDNAs have been sufficient incentive for us to synthesize a modified form of the alpha- and beta-globin genes using the preferred yeast codons.

Pseudodimer-Containing Hemoglobins

Although the assembly of secreted F_(V) antibody fragments in which the V_(H) and V_(L) domains are fused by a 16 amino acid peptide has been demonstrated in E. coli, Better, J., Chang, P. Robinson, R., Horwitz, R., Science 240, 1041-43 (1988), Skerra, A. and Pluckthorn, A., Science 240, 1038-41 (1988), there are no previous examples of the cytoplasmic folding and assembly of a mammalian heterotetramer in which two of the subunits are fused to one another.

The present invention further contemplates in some embodiments the combination of (a) one molecule of a di-alpha globin-like polypeptide with two molecules of a beta globin-like polypeptide to form a "di-alpha" hemoglobin-like protein; (b) two molecules of an alpha-globin-like polypeptide with one molecule of a di-beta globin-like polypeptide to form a "di-beta" hemoglobin-like protein; or (c) one molecule of a di-alpha globin-like polypeptide with one molecule of a di-beta globin-like polypeptide to form a "di-alpha/di-beta" hemoglobin-like protein.

It should further be noted that the delta, gamma and epsilon chains have considerable homology with the beta chain and that the zeta chain has considerable homology with the alpha chain. Di-delta, di-gamma, di-epsilon and di-zeta polypeptides are therefore within the compass of the invention and may be used in the preparation of novel hemoglobins of types other than Hgb A1.

In the liganded form, hemoglobin readily dissociates into αβ dimers which are small enough to pass through the renal glomeruli, and hemoglobin is thereby rapidly removed from the circulatory system. Intravenous administration of hemoglobin in amounts far less than that needed to support oxygen transport can result in long term kidney damage or failure. Ackers, G. K. and Halvorson, H. R., Proc. Nat. Acad. Sci. (USA) 71, 4312-16 (1974); Bunn, H. F., Jandl, J., J. Exp. Med. 129, 925-34 (1969). If dissociation into dimers is prevented, there is an increase in intravascular half life and a substantial reduction of renal toxicity. Lee, R., Atsumi, N., Jackbs, E., Austen, W., Vlahakes, G., J. Surg. Res. 47, 407-11 (1989). The pseudotetrameric hemoglobin of this invention cannot dissociate into αβ-dimers without the breakage of a peptide bond and should have the advantages of a longer intravascular half life and reduce renal toxicity.

In the crystal structures of both deoxyhemoglobin and oxyhemoglobin the N-terminal Val residue for one α subunit and the C-terminal Arg residue of the other α subunit are only between 2 and 6Å apart, and are bound to one another through a salt bridge in deoxyhemoglobin. Fermi, G., Perutz, M., Shaanan, B., Fourme, R., J. Mol. Biol., 175, 159-74 (1984); Shaanan, B., J. Mol. Biol. 171, 31-59 (1983). This distance could be spanned by one or two amino acids. One extra amino acid can be added to the C-terminal Arg residue of the a subunits by trypsin catalyzed reverse hydrolysis without significantly altering the oxygen binding properties. Nagai, K., Enoki, Y., Tomita, S. and Teshima, T., J. Biol. Chem., 257, 1622-25 (1982) Preferably the di-alpha linker (if one is used) consists of 1 to 5 amino acids which may be the same or different. A Mono-Gly linker is especially preferred. In designing such a linker, it is important to recognize that it is desirable to use one or more amino acids that will flexibly connect the two subunits, transforming them into domains of a single di-alpha globin polypeptide.

The preparation of "di-beta" mutants is also contemplated. The distance between the N-terminus of one beta subunit and the C-terminus of the other is 18.4 Å in the deoxy configuration and 5.2 Å in the oxy form. Preferably, the di-beta linker consists of 2 to 9, amino acids which may be the same or different. Glycine amino acids are particularly preferred.

The length of the (-gly-)_(n) genetically fused link between the N-terminus of one beta chain (at beta₁, 1 Val) and the C terminus of the second beta chain (beta₂, 146 His) in di-beta hemoglobin may range between 1 and approximately 9 glycines. In the oxy and deoxy crystal structures of human hemoglobin A₀, the distance between these termini is 5.22 Å and 17.93 Å respectively (from the N-terminal nitrogen to the C terminal carbon of the carboxylate). A single glycine linker, which is a little less than 4 Å in length, may come close to linking the two termini in the oxy structure, however, it is expected that this linker will fall .sup.˜ 14 Å short in the deoxy structure. Significantly more perturbation of the deoxy structure vs the oxy structure might be anticipated with this linker. Some alterations in the oxygen binding properties may be caused by deletion of the positive and negative charges at the two termini and their inclusion in the amide bond. In addition, the linker molecule itself may destabilize the oxy structure less than the deoxy structure, and thus lead to a. relative increase in oxygen affinity. Likewise, two glycines inserted as linkers may also differentially stabilize the oxy structure and hence relatively increase the oxygen affinity by t he same mechanism described above.

When the number of linking glycines is increased to 5, the linker should just span the cleft between the beta chain termini in the deoxy structure, and, moreover, insert added steric bulk between the termini in the oxy structure, thus leading to a relative stabilization of deoxy (or destabilization of oxy) and perhaps resulting in a concomitant decrease in oxygen affinity. Due to the large space between the beta termini in the deoxy (but not the oxy structure), addition of glycine linkers in the range of 6-9 may further destabilize the oxy structure and, in the same manner, further decrease oxygen affinity.

A third form of globin pseudodimer is one comprising both alpha and beta globin domains. A possible route to fusing alpha1 to beta2 and so stabilizing hemoglobin against α₁ β₁ /α₂ β₂ dimer formation, is to fuse the alpha1 C-terminal residue to the N-terminal residue of beta2 C helix, creating a new C-terminus at the end of the beta2 B helix. The original beta N terminus, Val1, would be fused to the original beta subunit C-terminal residue, His146, by means of an intervening new section of protein, thus creating a continuous polypeptide chain comprising the alpha and beta subunits of different dimers. This chain may be described as follows: α(1-14)-Gly₃ -β(35-46)-Gly₁₋₃ -Ala₁₃ -Gly₁₋₃ -β(1-34); See FIG. 36.

Inspection of the structure of human deoxyhemoglobin using. a molecular graphics computer indicates the following relevant distances. The distance between the Alpha1 Arg141 carboxyl carbon and Beta2 Tyr35N atoms is approximately 8.6 Angstroms. A fully extended linear triglycine peptide measured approximately 10.1 Angstroms from the N to C terminal residues. This suggests that three glycine residues could be employed to span the distance between the Arg141 and Tyr35 residues with a minimum of unfavorable steric interactions and maximum conformational freedom. The distance requirements could be different in oxyhemoglobin, and if so, the sequence of the fusion peptide could be altered to best accommodate the requirements of both structures.

In human deoxyhemoglobin, the distance between the Beta2 His146 carboxyl carbon and the Beta2 Val1 nitrogen atoms is approximately 25 Angstroms. A right handed 3.6 Alpha helix constructed from a linear sequence of 13 Alanine residues was found to measure 22 Angstroms from N to C terminus. With the addition of one to three glycine residues at each end of this helix (to give Gly_(n) (Ala)₁₃ Gly_(n) where n=1 to 3), it could span the required distance and have sufficient conformational flexibility to avoid serious tertiary packing conflicts. Additionally, the amino acid sequence of the helix could be altered to introduce favorable hydrogen bonds and salt bridges between the new helix and the Beta2 helix against which it would pack in the folded protein. Such interactions could aid stabilization of the engineered protein.

Glycine is the preferred amino acid in the linkers, since it is known to be quite flexible, Cantor and Schimmel, Biophysical Chemistry, part 1, pp. 266-9 (1980), and also allows chains into which it is incorporated to assume a more compact structure. However, the residues comprising the linker are not limited to glycines; other residues may be included instead of or in addition to glycine, such as alanine, serine, or threonine. Since these amino acids have a more restricted conformational space in a protein, they will likely result in more rigid linking chains, and hence have a more pronounced effect on the relative stabilization/destabilization of the oxy/deoxy structures.

It should be understood that the minimum and maximum number of amino acids in the linker is a function of the distance to be spanned in both the oxy or deoxy forms, the amino acids chosen, and the propensity of the particular amino acid sequence to form a secondary structure. While a random coil is usually preferred, it is not required, and a linker with a larger number of amino acids in a secondary structure may have the same span as a random coil linker with fewer amino acids. A linker may comprise, e.g., 1-3 glycines, followed by a sequence having a secondary structure, followed by 1-3 more glycines. The translation per residue, in angstroms is 1.9 for polyproline I, 3.12 for polyproline II, 3.1 for polyglycine II, 3.4 for an antiparallel β sheet, 3.2 for a parallel β-sheet, 1.5 for a right handed α-helix, 2.0 for a 310 helix, and 1.15 for a helix. In a fully extended chain, the maximum translation per residue is 3.63 Å if the repeating units are staggered and 3.8 Å if the peptide bond is trans.

The number of amino acids in the linker may be such that a formation of a secondary structure, such as an alpha helix or a beta-sheet, is undesirable, as the span is reduced. Certain amino acids have a greater tendency to participate in such structures. See Chou and Fasman, Biochemistry, 13:222-245 (1974), incorporated by reference. The amino acids are ranked in order of decreasing participation below. The preferred linker amino acids are boldfaced. Glycine is the most suitable amino acid for this purpose. The most preferred di-alpha linkers are Gly or Gly-Gly.

    ______________________________________                                         Alpha Helix          Beta Sheet                                                Formers              Formers                                                   ______________________________________                                         Glu (1.53)           Met (1.67)                                                Ala (1.45)           Val (1.65)                                                Leu (1.34) Hα  Ile (1.60) Hβ                                        His (1.24)           Cys (1.30)                                                Met (1.20)           Tyr (1.29)                                                Gln (1.17)           Phe (1.28)                                                Val (1.14)           Gln (1.23)                                                Trp (1.14)           Leu (1.22)                                                Phe (1.12) hα  Thr (1.20)                                                Lys (1.07)           Trp (1.19) hβ                                        Ile (1.00)           Ala (0.97) Iβ                                        Asp (0.98)           Arg (0.90)                                                Thr (0.82)           Gly (0.81)                                                Arg (0.79)           Asp (0.80) iβ                                        Ser (0.79)           Lys (0.74)                                                Cys (0.77) iα  Ser (0.72)                                                Asn (0.73)           His (0.71)                                                Tyr (0.61) bα  Asn (0.65)                                                Pro (0.59)           Pro (0.62) bβ                                        Gly (0.53) Bα  Glu (0.26) Bβ                                        ______________________________________                                    

(The letter symbols are Hα, strong α former; hα, α former; Iα; weak α former; iα, α indifferent; bα, α breaker; and Bα strong a breaker. The β symbols are analogous. Trp is bβ if near the C-terminal of a β-sheet region.)

The alpha helix of a polypeptide chain comprises an average of 3.6 residues per turn. In globular proteins, the average length is about 17 Å, corresponding to 11 residues or 3 helix turns. In alpha and beta globin, the helices range in length from 7 to 21 amino acids (A.A.). The beta pleated sheet comprises 2.3 residues per turn; the average length is about 20 Å or 6 residues.

Chou and Fasman define an alpha helix nucleus as a hexapeptide containing four helix forming residues and not more than one helix breaker, and a beta sheet nucleus as a pentapeptide containing three beta sheet forming residues and not more than one sheet breaker.

The amino acid sequence in the vicinity of the di-alpha linker is as follows:

    __________________________________________________________________________     residue #                                                                            138 139 140 141       1   2   3   4                                      __________________________________________________________________________     AA    Ser Lys Tyr Arg --(XXX).sub.n --                                                                     Val Leu Ser Pro                                    Helix Not                                                                            H21 HC1 HC2 HC3       NA1 NA2 A1  A2                                     Helix Pot                                                                            079 107 061 079       114 134 079 059                                    Sheet Pot                                                                            072 074 129 090       165 122 072 062                                    __________________________________________________________________________      (Note: Helix and sheet forming potentials have been multiplied by 100 for      typographical reasons.)                                                  

The di-alpha linker is preferably only 1-3 amino acids. Thus, it can form an alpha helix only in conjunction with the linker "termini". A one or two residue linker, even if composed of amino acids with strong secondary structure propensities, would be unlikely to assume an alpha helix or beta sheet configuration in view of the disruptive effect of, e.g., Arg 141 or Ser 3. If the linker is 3 residues long, it would be preferable that no more than one residue be a strong alpha helix former, unless the linker also included a strong alpha helix breaker.

The amino acid sequence in the vicinity of the di-beta linker may impose more stringent constraints.

    ______________________________________                                         143  144    145    146          1    2    3    4                               ______________________________________                                         His  Lys    Tyr    His  --(XXX).sub.n --                                                                       Val  His  Leu  Thr                             H21  HC1    HC2    HC3          NA1  NA2  NA3  A1                              124  107    061    124          114  124  134  082                             071  074    129    071          165  071  122  120                             ______________________________________                                    

The di-beta linker is likely to be longer (preferably 1-9 A.A.) and therefore more susceptible to secondary structure formation. If secondary structure formation is not desired, it is desirable that the amino acid adjacent to Val-1 be an alpha helix breaker (e.g., Glycine) in view of alpha-helix propensities of Val-His-Leu. More generally, it is desirable that the linker not contain (or cooperate with the proximately linked amino acids to form) an alpha helix nucleus or beta sheet nucleus.

When secondary structure is not desired, amino acids with a high propensity toward alpha helix formation may be used in the linker if accompanied by "helix breaking" amino acids. Similarly, Beta sheet formation may be prevented by "sheet disrupting" amino acids.

Of course, prediction of secondary structure using Chou and Fasman's approach has its limitations and the ultimate test of the acceptability of a linker is whether or not the di-alpha or di-beta hemoglobin has the desired affinity for oxygen. In particular, a poly-alanine linker, despite its supposed propensity to alpha-helix formation, may well be of value since the alanine group is compact and therefore the linker should be quite flexible if secondary structure does not form.

In an especially preferred embodiment, di-alpha and beta globin genes are combined into a single polycistronic operon. The use of a polycistronic operon is not, however, necessary to practice the present invention, and the alpha (or di-alpha) and beta (or di-beta) globin genes may be expressed from separate promoters which may be the same or different.

While the preferred "genetically fused hemoglobin" of the present invention is one comprising a di-alpha and/or di-beta globin, other globin chains may be genetically fused and used in the production of hemoglobins of species other than Hgb A1 (α₂ β₂).

Pseudo-Octameric (Ditetrameric) Hemoglobin-Like Proteins with Disulfide Bridges

The ability to produce pseudotetrameric recombinant hemoglobins consisting of a single dialpha polypeptide and two beta chains (or a dibeta polypeptide and two alpha chains) provides a unique opportunity to create an asymmetric pseudotetramer from the normally symmetric pseudotetramer. Because the two alpha globin domains are expressed as a single polypeptide, it is possible to alter one of the alpha globin domains without altering the other. The result is a protein that, in its final folded state, contains two different alpha globin domains in a strict 1:1 ratio. This type of asymmetric hemoglobin molecule, with its unique chemical properties, cannot be easily constructed by any other method. A preferred embodiment of this invention would involve use of site-directed mutagenesis to substitute a cysteine residue in one of the two alpha globin domains of a di-alpha hemoglobin such as SGE1.1 (a di-alpha hemoglobin with a beta chain Presbyterian mutation) such that the cysteine would be on the surface of the folded recombinant hemoglobin molecule. A homogeneous preparation of pseudo-octameric hemoglobin could then be formed through interhemoglobin linkage of two pseudotetramers either directly by simple oxidation of purified pseudotetramers or by reaction with a bridging molecule (FIG. 27).

Advantages of a pseudo-octamer stem from several possible therapeutic features of this construct: (I) half-life in the bloodstream of the pseudo-octamer (MW⁻⁻ 128,000) is expected to be longer than the halflife of pseudotetrameric Hb (MW⁻⁻ 64,000). Studies of halflife as a function of macromolecular size indicate a correlation between increased size and increased circulatory halflife for chemically crosslinked Hb as well as other macromolecules. (II) because the number of oxygen binding heme groups per octamer is twice the number per tetramers, independent of size, the oncotic pressure for a given concentration of heme groups in a solution of octameric Hb is expected to be half that of an equimolar solution of heme contained in tetrameric Hb. Because of oncotic pressure effects, the maximum concentration of free tetrameric Hb that may be introduced into the blood stream is less on a per volume basis than the concentration of Hb normally carried in intact red blood cells. Reduction of oncotic pressure is therefore useful in increasing the per volume oxygen carrying capacity of a blood substitute. (III) although free hemoglobin purified from natural sources may be polymerized by chemical crosslinking to increase halflife via increased molecular weight, and to reduce oncotic pressure, all such preparations are heterogeneous. Not only does the genetic engineering approach provide a means of synthesizing a higher molecular weight complex without the need to involve chemical crosslinking and consequent additional purification steps, this approach also provides the first means of exerting strict control over the degree of polymerization. The ability to strictly control formation of pseudo-octamers will greatly facilitate characterization of the final product and will reduce the chance of adverse reaction to minor components.

Although direct formation of a disulfide bond between two SGE1.1 mon cys hemoglobins is desirable in order to avoid the need for chemical crosslinking, the halflife of such a bond in the bloodstream is unknown. Naturally occurring reducing agents may reduce the disulfide bond in vivo at a rate comparable to or greater than the halflife of the pseudo-octamer. The rate of reduction of the bond may be influenced by the location of the cysteine mutation on the surface of the hemoglobin. It may also be possible to insert two cysteine mutations in each hemoglobin in close enough proximity that steric hindrance would allow formation of a two-disulfide pseudo-octamer but prevent formation of higher molecular weight complexes.

Poly (Tetrameric) Hemoglobin With Other Intercysteine Linkages

It is also possible, of course, to couple two SGE1.1 mono cys molecules with a homobifunctional crosslinking reagent resulting in linkage via nonreduceable bonds. The degree of polymerization is still controlled by the use of the mono cys di-alpha or di-beta Hgb starting material. (Note that reference to SGE1.1 mono-cys in this section is for illustration only; other mono-cys di-alpha or di-beta Hbgs could be used in its place.)

By using bi-, tri-, tetra-, hexa-, or octa-functional crosslinkers several properties of multimeric hemoglobin which may contribute to longer serum half life can be controlled. The crosslinkers can be designed to give a nonreducible disulfide bond between two tetramers, to yield high molecular weight multimers of n>2 psuedo-tetramers (e.g. dodecamers, etc.) and/or to drop the overall isoelectric point of a hemoglobin octamer to further increase its half life.

Correlations of molecular weight with serum half life for proteins such as IL-2, demonstrate that a significantly longer half life may be expected as the molecular weight of a protein increases, particularly above the renal filtration limit of 50-70 kDa. SGE 1.1 mono-cys octameric hemoglobin will have a molecular weight of ca. 130 kDa, and this may double or triple its half life. However, a factor potentially limiting the half life of octameric hemoglobin formed by a disulfide link between two mono-cys SGE1.1's is reduction of the cys-cys disulfide bond by endogenous thiol-reducing agents found in the serum. Estimates of small molecule thiol levels in plasma vary from 17 μM to 5 μM. The major species is reduced glutathione. Other thiol compounds in plasma include cysteine, homocysteine, and gamma-glutamyl cysteine. Thus, small molecule plasma thiols are available for reduction of disulfide bonds. This may be reflected in the diminished half life seen with antibody-ricin A chains conjugates linked by regular disulfides (6.7 hrs) relative to conjugates linked with sterically hindered, and thus less reducible, alpha-methyl disulfides (42.5 hours).

Thus, in one embodiment, the octameric hemoglobin features a nonreducible disulfide crosslink such as a thioether bond or thiol-maleiimide adduct which may substantially extend the octamer half life. Simple homobifunctional crosslinkers or polyethylene glycol (peg) derivatives would likely be useful for this purpose (see below). The reaction of a bifunctional cysteine-specific crosslinker with a mono-cys di-alpha or di-beta Hgb should limit the products of the reaction to substantially a dumbbell-like octameric hemoglobin (FIG. 22) and unreacted hemoglobin. The reaction should be stoichiometric when the Hgb and crosslinker are present at high concentrations and the Hgb is present in a slight excess over the crosslinker maleiimides at pH 6.5-7.0. Further, there should not be substantial interference by reaction with SGE1.1 lysines. The preferential reactivity of the thiols to lysines can be roughly calculated as the product of their molar ratios and the ratio of the intrinsic reactivity of a maleiimide to thiols versus amines. This product is ca. 1 cys/40 lys!× 1000!=25 at pH 7. The side products would still be octamers, with one attachment site being a secondary amine and thus might well be functionally equivalent to the S-crosslinked octamers. Hydrolysis of the maleiimide adduct at pH 7 would be slow, and the ring opening would leave the crosslink intact.

The following are examples of homobifunctional crosslinkers that may form metabolically stable crosslinks between monocysteine pseudo tetramers:

1) 1,2-bis-(2-iodoethoxy)ethane

2) 4,4'-dimaleiimidylbenzene or N,N'-p-phenylenedimaleiimide

3) N,N'-bis-(3-maleiimido-propionyl)-2-hydroxy-1,3-propane diamine.

Longer half lives may also be obtained by increasing the apparent solution molecular weight by simply lengthening the distance between the two linked tetramers using a long crosslinking agent. The use of some potentially novel polyethylene glycol derivatives as homobifunctional crosslinkers, reacting with SGE1.1 mono-cys, may provide one mechanism for significantly increasing the molecular weight of octameric hemoglobin by virtue of the length of the crosslinker alone.

A suitable crosslinker for this purpose is

    maleiimido--CH.sub.2 CH.sub.2 C(═O)(OCH.sub.2 CH.sub.2).sub.n OC (═O) CH.sub.2 CH.sub.2 --maleiimido.

The length may be adjusted by variation of n. A few examples are given below.

    ______________________________________                                         Structure     Max Length                                                                               Source                                                 ______________________________________                                         n = 22         .sup.˜ 49Å                                                                    peg -1000                                              n = 76        .sup.˜ 166Å                                                                    peg -3350                                              n = 227       .sup.˜ 499Å                                                                    peg -10000                                             ______________________________________                                    

Homobifunctional N-hydroxysuccinimide-activated peg has been used previously to derivatize hemoglobin. Yabuki, et al., Transfusion, 30:516 (1990). This reaction resulted in a mixture of monomeric, dimeric, and trimeric species with an average stiochiometry of peg/hemoglobin of 6.2. However, 83% of the hemoglobin derivatized by peg was not crosslinked to another hemoglobin molecule. Control of the peg-derivitization of wild-type hemoglobin is not possible because there is no site-directed labeling of the hemoglobin starting material.

In contrast, the combination of SGE 1.1 mono-cys starting material and a peg crosslinker should yield a monodisperse dumbbell (pseudo-octameric) product only. The site-direction of the crosslinker attachment site should result in precise control of the apparent molecular weight which will depend on the size of the crosslinker. moreover, careful control of the site of the cys mutation on the surface of the recombinant hemoglobin should ensure that the functionality of the derivatized hemoglobin is maintained.

Higher Poly(Tetrameric) Hemoglobins

The above crosslinkers all involve the attachment of two hemoglobins at each end of a crosslinker. It may be advantageous to attach more than two hemoglobins to a single crosslinker to yield more oxygen-carrying capacity and to further increase the molecular weight. A hypothetical 4-hemoglobin coiled-coil linker complex is shown in FIG. 28(a). Design and synthesis of these coiled coil peptides has already been explored (for an example see Cohen and Perry, Proteins, 7:1-15 (1990)). The rationale for a coiled coil is that two intertwined alpha helices will be less sensitive to proteolytic cleavage than a single naked secondary structure like an extended peptide (rapidly cleaved by proteases), an alpha helix or a beta sheet.

Using molecular modeling, an internal disulfide may be designed in the center of a bi-functional coiled coil linker such that the strands are covalently attached. This should stabilize formation of the correct coiled coil crosslinker before mono-cys di-alpha or di-beta Hgb (e.g., sge1.1 cys) is attached. Additionally, a tri-functional crosslinker can be stabilized by use of a orthogonally-protected lysine (lys-FMOC) rather than a disulfide in the center of a proteolytically inert secondary structure. A polyproline helix can be used as the linker, and can be stabilized by branching the synthesis at the lys-FMOC after removal of the side chain. The three remaining lysines in the branched peptide would then be iodoacetylated to site-specifically attach a thiol-reactive group using either iodoacetic anhydride or N-succinimidyliodo-acetate and subsequently reacted with sge1.1-cys. An analogous tetra-functional crosslinker could be synthesized by inserting 1-2 prolines between two internal branching lysines to rotate them such that the two internal branching chains growing off the orthogonally protected lysines head in (nearly) opposite directions. Analogous structures could be made using D-glutamate(E) or D-aspartate(D) to provide protease resistance, and these would form an extended polyanionic chain at pH 7.

The sequence of a hypothetical alpha-helical coiled coil is modified from that given in Semchuck, et al., in Peptides: Chemistry, Structure and Biology; 566 (Rivier and Marshall, eds:1990), to leave only two lysines (K) at each end:

AcKCAELEGRLEALEGRLEALEGRLEALEGRLEALEGKL-amide (SEQ ID No:1)

This coiled coil should have about 10 turns of a helix and thus will be ca. 54 Å long, allowing two hemoglobins to attach on each side without steric interference. The exact sequence and length to allow appropriate placement of 4 sge1.1's would depend on the results of molecular modeling. Suggested trifunctional and tetrafunctional crosslinkers are diagrammed below. ##STR1##

Another possibility is an 8-hemoglobin complex (FIG. 28(b)). The rationale for considering this sort of complex is that it may be the way to obtain a very long half-life SGE1.1, due to the extreme stability of the "crosslinker" and the substantially higher molecular weight of the complex. The crosslinker might take the form of a doubly branched coiled coil, with a Lys(FMOC) replacing an Arg in the middle of the chain to allow the branching, and with a polyproline helix or other protease resistant secondary structure comprising the branching moiety. This structure could allow attachment of 6 SGE1.1's per crosslinker. Alternatively, a 4-helical bundle protein (See FIG. 28(c)) or 4-stranded coiled-coil such as those synthesized by DeGrado, Science, 243:622 (1989), with each helix in the 4-helical bundle containing the consensus sequence GELEELLKKLKELLKG (SEQ ID No: 2), the helices being linked by three PRR or RPR loops, could be utilized as a suitable core for the linker. This is one of the most stable proteins known, with a G=-22 kcal/mole separating the folded from the unfolded state. Each helix would be 4+ turns or ca. 22 Å long. Since this may not be enough room to fit two hemoglobins with one anchored at each end of the helix, they might have to be attached to different faces of the same helix, to Lysines placed at each end of the polar face of each helix. Each helix is amphipathic; this should allow relative freedom to have a total of 8 lysines (and no more) and to change the remaining lysines to arginines. At least two of the i, i+4 salt bridges per helix would be retained for stability of the protein. Attachment of SGE1.1-SH could be via iodoacetylation of the lysine epsilon amino groups and then reaction with SGE1.1-SH.

An example of a modification that might allow more room between hemoglobins would be addition of one turn of the helix to the N-terminus of the A and C helices and the C-terminus of the B and D helices. This and similar modifications would be subject to modeling and experimental constraints.

Analogous core proteins could be made as mutants of known 4-helical bundle proteins such as myohemerythrin or apoferritin, with the surface residues changed so that 8 (or more if topologically possible) lysines (2 per helix) exist on the surface for subsequent modification and attachment of SGE1.1

Poly(Tetrameric) Hemoglobins with Reduced Isoelectric Points

If the isoelectric point of the whole crosslinked conjugate also affects the serum half life, via electrostatic exclusion from the renal filter's "pore", additional negative charges could be included in the crosslink itself (rather than in the hemoglobin, which could change the function of the molecule) to drop the isoelectric point of the overall crosslinked particle. An additional benefit of this might be reduced uptake by the reticuloendothelial system, this uptake being a function of pI for cationized albumin.

We have preliminary evidence from succinylation of SGE1.1 which correlates the number of modified lysines with isoelectric point. This gives a rough estimate of the number of lys to glu and/or lys to asp mutations which may be necessary to reach a pI of 5 or less, the pI range which we expect we need to significantly extend half life. We believe that as many as 8 lysines may have to be modified (a total shift in charge of 16 units) to drop the pI roughly 2 units. It should be less disruptive of the functional properties of hemoglobin to do this via a peptide crosslinker rather than by mutation of the alpha and beta globin subunits proper. However, some mutations could be made in the crosslinker and the remainder in the subunits. As before, the SGE1.1-cys would be attached to iodoacetylated lysine epsilon amino groups by reaction at pH 6.5-7.0.

For human serum albumin in the rat, the half life varied roughly linearly with the pI of the protein, from ca. 4.6 hours for native albumin (pI=4) to 0.8 hrs at a pI above 9.5. Clearance was probably by multiple mechanisms, including potentially increased uptake into the reticuloendothelial system with increased pI. For rat trypsinogens, the difference in serum half life between versions with a pI of 5.0 (t_(1/2) of 4 min) was even larger. Thus a lower pI clearly appears to be an important variable in the serum half life of these proteins.

The following table gives examples of crosslinkers between SGE1.1-cys tetramers which should diminish the isoelectric point of the overall complex.

    ______________________________________                                         Source  Sequence                                                               ______________________________________                                         polyasp or polyglu                                                                      ##STR2##         n probably ≧ 10-12, x = D or E-               polyasp or polyglu                                                                      ##STR3##         n > 2 to provide flexibility at each terminus,                                 m ≧ 10-12, X = D.sup.-  or E.sup.-            polyasp or polyglu                                                                      ##STR4##         n ≧ 5-6, m ≧ 10-12, X = D- ior         ______________________________________                                                                   E-                                              

A number of the proposed crosslinkers could combine at least two, or possibly three of these attributes for potential additive effects.

It is possible that the unique amine groups in the peptide crosslinkers could be directly iodoacetylated during the peptide synthesis by treating iodoacetic acid as the last amino acid to be added, after deprotecting the lysine amine groups on the resin. In this case, the lysines would be orthogonally protected with N-FMOC or N-nitropyridinesulfenyl groups, or with BNPEOC. This could greatly simplify their synthesis.

Alternate methodologies to iodoacetylation as part of the synthesis could include the reaction of either sge1.1-SH or the peptide crosslinker with a heterobifunctional crosslinker specific for sulfhydryls and amines, such as sulfo-SMCC or similar reagents available from Pierce Chemical Co. (Rockford, Ill.).

Genetically Fused Poly(Tetrameric) Hemoglobins

Another approach to the preparation of multimeric (e.g., polytetrameric) hemoglobin involves the genetic fusing of individual tetramers utilizing other linkers. Two or more tetramers may be crosslinked, depending on the desired molecular weight and the efficiency of folding of the final molecule. The brine shrimp Artemia produces three polymeric hemoglobins with nine genetically fused globin subunits. Manning, et al., Nature, 348:653 (1990). These are formed by variable association of two different subunit types, a and bβ of the eight intersubunit linkers six are 12 residues long, one is 11 residues and one is of 14 residues. Similarly, the dialpha (or dibeta) subunits from different tetramers of a di-alpha or di-beta Hgb might be genetically fused together into an extended polypeptide which would link the individual pseudotetrameric domains.

Proteolytically stable extended polypeptide linkages can be envisioned. Desirable linker features might include 1) a number of glycines at each end to allow flexibility in entering the dialpha (or beta) terminal domains, and to decouple the linker secondary structure from that of the dialpha (or beta) terminal domains; 2) stiffness to separate tetramers, obtainable by an extended structure such as a polyproline helix or by polyglutamate or polyaspartate; and 3) inertness to proteases (vide supra or as in a collagen sequence). Several examples of such sequences are listed below. Obviously any other of the peptide linkers mentioned in this specification could be tried after first sterically modeling the fused-dialpha (or dibeta) termini environment. The links would go from the C-terminus of one dialpha to the N-terminus of the next and would be synthesized as a single gene. Besides modeling segments of protease-resistant or negatively charged secondary structure, one or more of the Artemia linkers should be modeled between tetramers. The beta chains could also be joined in this fashion, although the results of this on protein function would be unknown. It might be feasible to make an intermolecular di-beta (sge1.1) with or without additional intrachain crosslinkages.

    ______________________________________                                         Source    Sequence                                                             ______________________________________                                         polyproline helix                                                                        di α or β C term-(G)n-(P)n-(G)n-di α or β                N terminus                                                                     n probably ≧ 3, m probably ≧ 10-12                     polyaspartate or -                                                             (G)n-(D)n-(G)n- (should drop pI of complex)                                    glutamate                                                                      Artemia linker                                                                 (G)n-LRRQIDLEVTGL-(G)n-; n ≧ 0                                          (example) (SEQ ID NO:3).                                                       a helical coiled                                                               (G)n-KCAELEG(KLEALEG).sub.4 <-- not fused to                                   coil      terminus (SEQ ID NO:4).                                                        (should form octamer with coiled-coil crosslink)                     ______________________________________                                    

Two structures of human hemoglobin A₀ (either both in the oxy form or both in the deoxy form) taken or assembled from the Brookhaven Protein Data Bank were docked as close together as possible without van der Waals overlap between any residues, using the program Insight (Biosym. Inc., San Diego, Calif.). The distance from the alpha chain C terminal residue arg 141 to the amino terminal nitrogen of the alpha chain N terminal residue val 1 (in one structure) was then measured. This distance was ca. 22 Å when both molecules had the oxy structure and ca. 18 Å when both were in the deoxy structure. In the oxy and deoxy structures, the valine at the alpha chain N terminus is exposed at the side of a cleft in the structure, while the arg carboxylate is at the bottom of the cleft. Thus it is possible to genetically fuse these two termini without suffering a large structural displacement of residues around either terminal amino acid. Linkers which have sufficient length to stretch between these termini when the two hemoglobins are docked together (but without any van der Waals overlap) are listed below.

An alternative fusion may be envisioned between a truncated alpha chain in one hemoglobin and the N terminal alpha val 1 in the second hemoglobin. The first molecule could be truncated at ser 138, which intermolecular N terminal to C terminal distance is about 17 Å (deoxy) and 22 Å (oxy), and examples of genetically inserted linkers spanning this distance are listed below.

Thus two hemoglobin molecules could be linked (by fusing two intermolecular alpha domains) to generate a fusion protein approximately twice the size of normal human hemoglobin. An additional intramolecular crosslink, as introduced into rHb1.1 to prevent dissociation of hemoglobin into dimers, could be included as well, giving a fusion of four alpha domains.

We expect that the genetically inserted links will be stable in the presence of proteases, due to the steric occlusion by the two hemoglobins surrounding the linkage. This resistance may be further enhanced by the use of glycines, bonds between which may be less susceptible to proteases, since most proteases have side chain specificity for residues other than glycine (which has only a hydrogen as a sidechain, and thus may result in a poor Km of this substrate for a protease). A polyproline helix may also be used as a linker to enhance stability to proteases. Fusion of a polyglutamate or polyaspartate as a linker might allow a much lower isoelectric point for the complex, and thus a longer serum half life.

    ______________________________________                                                          end-to-end                                                                              con-                                                 linker           Distance formation                                                                               comments                                    ______________________________________                                         --(gly).sub.7 -- 25Å  extended minimal                                                                        length for                                                                     gly linker to                                                                  span termini                                                                   in both oxy                                                                    and deoxy                                                                      structures.                                                                    Longer                                                                         linkers (up                                                                    to 20-50                                                                       residues)                                                                      may also                                                                       work                                                                           favorably.                                  --(gly).sub.1-3 --(ala).sub.12 --(gly).sub.1-3 --                                               20Å-40Å                                                                         Ala in   the Gly are                                                           right    added for                                                             handed   flexibility                                                           alpha    and minimal                                                           helix    disturbance                                                                    of Hb                                                                          structure                                                                      around their                                                                   fusion with                                                                    the N and C                                                                    termini.                                                                       Length is                                                                      dependent on                                                                   the number                                                                     of glycines                                                                    and the                                                                        degree of                                                                      extension.                                  --(gly).sub.1-3 --(pro).sub.12-16 --(gly).sub.1-3 --                                            21-48Å                                                                              pro in a 12, 14, 16                                                            left     prolines.                                                             handed   Length                                                                poly-    dependent on                                                          proline  number of                                                             helix    prolines and                                                                   glycines                                    --(gly).sub.1-3 --(asp).sub.n --(gly.sub.1-3)--                                                 26-49Å        Asp residues                                                                   add negative                                                                   charges in                                                                     pseudo-                                                                        octamer.                                    ______________________________________                                    

Other residues could be substituted into these linkers while leaving their length essentially the same, including complete linkers taken from the sequence of other known human proteins such as hemoglobin, to prevent any recognition of the pseudooctamer as a foreign protein.

Use of linkers with a maximal length more than 18 Å and less than 22 Å may differentially stabilize the deoxy structure, and may result in a lowered oxygen affinity for the pseudooctamer.

Octameric Hemoglobins Formed Without Use of an Pseudooligomeric Globin

It is possible to produce an octameric hemoglobin, without substantial production of higher multimers, by suitable cysteine mutation of either the alpha or beta chain (see FIG. 35).

Hemoglobin mutants containing one X to cys mutation in the beta chain gene (giving two per tetramer) or in the alpha chain gene (also giving two per tetramer), in which the residues mutated to cysteine are both on or very close to the surface of the subunit and are as close (preferably 10-15 Å) to the dyad axis separating the subunits, may form octamers (two hemoglobins) linked by two disulfides. Polymerization of such mutants should be retarded by the proximity of the two disulfides to each other, such that after one disulfide is formed, a third incoming hemoglobin will be sterically hindered from reacting with either free cysteine on the two original hemoglobins.

Because it is possible that this mutant may form higher order polymers (rather than simply the octamer), a diluted solution may be used in vitro for formation of disulfide bonds. The kinetics of polymerization of hemoglobin should be at least second order (or a higher order) in hemoglobin concentration, while after one disulfide is formed, the formation of the second disulfide between two tetramers should be zero order in hemoglobin. Thus the ratio of polymerized product to octamer should diminish as the hemoglobin concentration is decreased. If formation of octamers is done under oxygenated conditions, the yield of octamers vs. polymers may increase further, since the distance between the two cys mutations is less in every case in the oxy hemoglobin structure than in the deoxy structure.

A list of preferred mutation sites in both the beta chain and the alpha chain is provided below:

Beta and alpha chain mutation sites for x to cys mutations to form disulfide-bond linked octameric hemoglobin.

    ______________________________________                                         Chain  mutation distance                                                                               distance,                                                                             Comment                                         ______________________________________                                         beta asn                                                                              80 to cys                                                                               22Å 18Å                                                                               no listed deleterious                                                          mutations, asn 80 is on                                                        surface.sup.a                                   beta asp                                                                              79 to cys                                                                               24Å 22Å                                                                               Hb Tampa.sup.a (asp to tyr) has                                                no major abormal property                                                      listed; Hb G-Hsi-Tsou (asp                                                     to gly) has increased O.sub.2                                                  affinity; is on surface                         alpha asn                                                                             78 to cys                                                                               24Å 20Å                                                                               on surface; no major.sup.a                                                     abnormal properties of                                                         known mutations of asn 78                       alpha asp                                                                             75 to cys                                                                               22Å 18Å                                                                               on surface; no major.sup.a                                                     abnormal properties of                                                         known mutations of asp 75                       alpha asp                                                                             74 to cys                                                                               26Å 20Å                                                                               on surface; no major.sup.a                                                     abnormal properties of                                                         known mutations of asp 74                       ______________________________________                                          .sup.a R. N. Wrightstone. Policies of the International Hemoglobin             Information Center (IHIC), Comprehensive Sickle Cell Center, Medical           College of Georgia. 1988.                                                

Miscellaneous

The appended claims are hereby incorporated by reference as a further enumeration of the preferred embodiments. All cited references are incorporated by reference to the extent necessary to enable the practice of the invention as now or hereafter claimed.

By the means suggested in this specification, we have had the following achievements:

(1) Expression of Met-FX-alpha globin in E. coli from a dicistronic operon comprising a introductory cistron and a Met-FX-alpha globin gene, both transcribed from a Tac promoter. (Example 2)

(2) Expression of Met-FX beta globin in E. coli by similar means. (Example 2)

(3) co-expression of Met-FX-alpha globin and Met-FX-beta globin in E. coli from a tetracistronic operon comprising an introductory cistron, an FX-alpha globin gene, an intercistronic sequence, a second introductory cistron, and an FX-beta globin gene, all controlled by a single Tac promoter. The alpha and beta globins were intracellularly assembled into functional FX hemoglobin. The FX Hgb was enzymatically converted to Hgb. (Example 3)

(4) Co-expression as in (3) above of mutants of FX-hemoglobin in which the beta globin subunits possessed the Beth Israel, Cheverly, Providence/MSR, Kansas or beta⁶⁷ Val→Ile mutations. (Example 4)

(5) Co-expression as in (3) above of Met-alpha globin and Met-beta globin and intracellular assembly into Met-Hgb (a.k.a. Des-FX-Hgb). (Example 5)

(6) Co-expression as in (3) above of Des-Val-alpha globin and Des-Val-beta globin. (Example 6)

(7) Co-expression as in (6) above, but with the Des-Val-beta globin gene preceeding the Des-Val-alpha globin gene within the operon. (Example 6)

(8) Co-expression as in (3) above of (Des-Val)-alpha-(Gly-Gly)-alpha globin and beta-globin and intracellular assembly into a di-alpha Hgb. (Example 8)

(9) Co-expression as in (8) above of (Des-Val-alpha-(GlyGly)-alpha globin and a mutant (Nagai, Arg-Nagai, or Kansas) of beta globin and intracellular assembly into a mutant di-alpha Hgb. (Example 9)

(10) Co-expression as in (3) above of Des-Val-alpha globin and a Des-Val beta-globin with the Presbyterian mutation and intracellular assembly into a mutant Des-Val Hgb. (Example 11) .

(11) Co-expression as in (8) above of a (Des-Val)-alpha-(GlyGly)-alpha globin and of a beta1 globin with the Presbyterian mutation. (Example 11).

(12) Expression of di-alpha globin and beta-globin from separate promoters on the same plasmid (Example 12).

(13) Co-expression of di-alpha globin and two copies of beta globin from same operon (Example 13).

(14) Expression of Di-beta Hgb (Example 14).

(15) Devised Hypothetical Protocol for co-expression of alpha globin and beta-globin under control of separate promoters on the same plasmid (Example 16).

(16) Devised Hypothetical Protocol for co-expression of alpha and beta-globin under control of different promoters on different plasmids (Example 17).

(17) Co-expression of a and 3 globin in S. cerevisiae (Example 19).

(18) Co-expression of di-α globin and 3-globin in S. cerevisiae. (Example 23).

(19) Construction of Des-Val-alpha globin and Des-Val-beta globin in E. coli under lambda PL control.

(20) Co-Expression of di-alpha globin and Des-Val-beta globin in E. coli under lambda PL control.

(21) Co-Expression of alpha and beta globin from separate plasmids in diploid strains of S. cerevisiae.

(22) Co-Expression of di-alpha globin and beta globin Presbyterian mutant in S. cerevisiae.

(23) Preparation of vectors for expression of di-alpha globins with -Gly- or -Pro- linkers.

(24) Evaluation of different strains and induction temperatures for expression of di-alpha Hgb in E. coli.

(25) Co-Expression of alpha and mutant beta globin in S. cerevisiae, assembling to form low-affinity Hgb mutants.

(26) Construction of vectors for expression of monocysteine (asymmetric) mutants of di-alpha and di-beta hemoglobin.

(27) Hypothetical protocol for expression of a disulfide-bonded pseudooctamer.

An unexpected and surprising change in oxygen binding characteristics of hemoglobin was observed upon replacement of the N-terminal valine with methionine. As illustrated in Example 11, hemoglobin A₀ purified from blood has a P₅₀ value of 4.3 with N=2.8 when measured at 25° C. DesFX-hgb produced in E. coli, a hemoglobin identical to A₀ except for the addition of a methionine at the N-termini of the alpha and beta chains, has essentially the same P₅₀ and N values. (Within experimental error, Example 7). Thus, the addition of a methionine, without altering the adjacent valine residue, has little or no effect on oxygen binding. On the other hand, a higher P₅₀ value, 6.6, was observed for desVal-hgb produced in E. coli, a hemoglobin in which the normal N-terminal valine of each chain was replaced with methionine. Cooperativity, as measured by N, was virtually the same, however, for all three molecules.

A similar comparison was made for two hemoglobins each containing the Presbyterian mutation, one produced in E. coli (Example 11) and one in yeast. The E. coli hemoglobin was constructed with a Des-Val alpha chain, i.e., the N-terminus had the normal valine replaced with methionine. Oxygen binding was characterized by P₅₀₌₁₉.8, N=2.5 at 25° C. and by P₅₀₌₃₄.5 and N=2.5 at 37° C. (Example 11). The corresponding yeast coding region begins with an additional methionine codon in front of the normal valine codon. Because this initial methionine is removed post translationally in vivo, the purified hemoglobin has a normal N-terminal valine. For this molecule, P₅₀₌₂₃ to 25 and N=2.5 when measured at 37° C. Thus, in the above instances, the replacement of an N-terminal valine with an N-terminal methionine increased the P₅₀ value. Under physiological conditions, it is expected that the genetically fused Presbyterian hemoglobin produced in E. coli will deliver 20-30% more oxygen than the similar hemoglobin, with its altered N-terminus, produced in yeast.

A very large number of different plasmids are referred to in the Examples which follow. In order to highlight the relationships among these plasmids, a Table of vectors has been compiled (See Table 200).

EXAMPLE 1 Construction of FX-alpha Globin (pDLII-62m) and BetaGlobin (pDLII-10a) Expression Vectors

1.1 Materials and Methods

Unless otherwise stated all electroelutions, phenol/chloroform extractions, ethanol (EtOH) precipitations, alkaline-SDS plasmid purifications, agarose electrophoresis and DNA manipulations were carried out essentially as described by Maniatis et al. ("Molecular Cloning" Cold Spring Harbor, N.Y., 1982).

The following abbreviations and definitions are used: ethylenediaminetetraacetic acid (EDTA); sodium dodecylsulfate (SDS); polyacrylamide gel electrophoresis (PAGE); dimethylsulfoxide (DMSO); dithiothreitol (DTT); isopropyl-beta-D-thiogalactopyranoside (IPTG); 2 xYT medium (16 g bacto tryptone, log Bacto yeast extract, 5 g NaCl per liter water); SDS-PAGE loading buffer (0.125M Tris-HCl, pH 6.8, 20% V/v glycerol, 2% SDS, 2% 2-mercaptoethanol, 0.01% bromphenol blue); phosphate buffered TB medium (24 g yeast extract, 12 g tryptone, 4 mL glycerol, 17 mL 1M KH2PO₄, 72 mL 1M K₂ HPO₄ per liter water); Tris-EDTA buffer (TE: 10 mM Tris, pH 8.0, 1 mM EDTA); Tris-borate, EDTA buffer (TBE: 0.089M Tris, 0.089M boric acid, 0.002M EDTA); Tris-acetate EDTA buffer (TAE: 0.04M Tris, 0.04M acetic acid, 0.001M EDTA).

Protein electrophoresis was performed by the method of Laemmli, U.K. (Nature 1970, 227, 680-685) on 15% SDS-polyacrylamide gels.

E. coli JM109 cells were made competent as follows: Two hundred milliliters of 2xYT medium was inoculated with 2 mL of an E. coli JM109 overnight culture. The 200 mL culture was then incubated with shaking at 37° C. for 1 hour. Cell were harvested by centrifugation at 6000 rpm at 4° C. for 4 min in a Beckman JS13.1 swinging bucket rotor (Beckman Instruments, Inc., Palo Alto, Calif.). The cells were resuspended in 50 mL total of a buffer containing 45 mM MnC12, 60 mM CaCl₂, 40 mM KOAc, pH 6.2, 15% sucrose (w/v) 1.3% RbCl (w/v), and 7.5% (v/v) glycerol. Following centrifugation as above, the cells were resuspended in 20 mL of the same buffer and incubated at 0° C. for 30 minutes. Cells were dispensed in one milliliter aliquots and stored at -80° C. until used.

Unless otherwise stated, all restriction enzyme digests were performed under conditions suggested by the manufacturer. DNA concentrations were determined by Beer's law using measured absorbance at 260 nm (A₂₆₀) against a water reference standard. For synthetic oligonucleotides the following extinctions were used: E²⁶⁰ =0.05 mL/ug/cm and for double stranded DNA E²⁶⁰ =0.02 mL/ug/cm.

1.2 Globin Gene Synthesis

Each globin gene was constructed from 14 separate oligonucleotides ranging in length from 50-85 base pairs. The FX-alpha globin gene was synthesized from oligonucleotides SJH I-33α-f, SJH I-34α-f and SJH I-35a,b; FX-beta globin gene was synthesized from SJH I-36α-f, SJH I-37α-f and SJH I-38a,b. Each globin gene is preceeded by a short loader gene as previously described. oligonucleotides were synthesized on a Biosearch 8600 instrument using beta-cyanoethylphosphoramidite chemistry on 1,000 angstrom CPG columns (0.2 mmole) (Beaucage, S. L. and Caruthers, M. H. Tet. Lett. 1981, 22, 1859-1862). The sequence of these oligonucleotides is given in FIG. 4.

Unless otherwise stated, the oligonucleotides were cleaved from the columns using the following protocol: Approximately 0.5 mL of fresh, concentrated NH₄ OH was drawn into the column with a 1 mL syringe. The NH₄ OH was allowed to react for 20min then expressed into a glass vial (approximately 4 mL capacity). This process was repeated 2 times, the vial was filled to greater than 75% capacity with NH₄ OH, and heated at 55° C., overnight. The samples were lyophilized, resuspended in 0.1 mL H₂ O and the concentration estimated by measuring A₂₆₀.

Two hundred micrograms of the individual oligonucleotides were purified by urea polyacrylamide gel electrophoresis. To do this, an equal volume of 2× loading buffer (90% formamide (v/v), 0.5xTBE, 0.05% (w/v) bromophenol blue, 0.05% (w/v) xylene cyanol) was added to the oligonucleotide. The sample was heated at 95° C. for 10 min. and applied to a 10% acrylamide gel containing 7M urea and 1XTBE. Electrophoresis was at 800 volts for approximately 2 hrs. The full length oligonucleotide was visualized under ultraviolet light. That region of the gel was then excised and incubated in 3 mL of 100 mM Tris, pH 7.8, 500 mM NaCl, 5 mM EDTA buffer at 60° C., overnight.

The oligonucleotide solution was further purified by reverse phase chromatography as follows: A C18 Sep-Pak cartridge (Waters Associates) was washed with 10 mL 100% methanol followed by 10 mL of H₂ O. The oligonucleotide solution was applied to the column, washed with 20 mL H₂ O and eluted with 3×1 mL aliquots of 50 mM triethylammonium acetate, pH 7.3/methanol (1:1). The purified oligonucleotide was lyophilized, washed with 100% ethanol, dried, and resuspended in 0.1 mL H₂ O. The concentration was determined by A₂₆₀.

The synthetic FX-beta gene sequence (included in FIG. 5) was constructed as follows: 100 pmole of the following oligo nucleotides were kinased in 3 separate reactions. Reaction 1 contained oligonucleotides SJH I-36b, c, d, e, and f. Reaction 2 contained SJH I-37a, b, c, and e. Reaction 3 contained SJH I-37d, f, and SJH I-38a. After combining the appropriate oligonucleotides, the solutions were lyophilized to dryness and resuspended in 16 uL of H₂ O. Two uL of 10× kinase buffer (0.5M Tris-HCl, pH 7.4, 0.1M MgCl₂), 0.5 uL of 100 mM DTT, and 1 uL of 1.0 mM ATP were then added. The reaction was initiated by addition of 1 uL(2 U) of T4 polynucleotide kinase (IBI, Inc., New Haven, Conn.). After incubation at 37° C. for 1 hour, the reactions were heated to 95° C. for 10 minutes to inactivate the kinase. The three reactions were combined and 100 pmoles of oligonucleotides SJH I-36a and SJH I-38b were added. After addition of 10 uL of 100 mm Tris, pH 7.8, 100 mM MgCl₂, the oligonucleotides were allowed to anneal by incubating at 65° C. for 30 min, 37° C. for 30min, and 15° C. for 1 hour. Annealed oligonucleotides were ligated by addition of ATP (1 mM, final) and DTT (10 mM final) and 4 uL (20 U) T4 DNA ligase (IBI, Inc., New Haven, Conn.) and incubation at 15° C. for 1 hour. Aliquots of this ligation mixture were then cloned directly into M13mp19 (see below).

Oligonucleotides for the construction of FX-alpha globin were similarly purified, kinased, annealed, and ligated. Before ligation into M13mp19, the full length FX-alpha globin gene was purified by electrophoresis through 0.8% agarose in 1XTAE buffer and electroeluted into dialysis tubing using 0.5× TBE as the electroelution buffer by the method of Maniatis et al. Eluted DNA was phenol extracted, EtOH precipitated, and resuspended in 20 uL TE buffer. Aliquots were used for cloning into M13mp19.

1.3 Phage Vectors

For cloning, a 2, 5 or 10 fold molar excess of the individual FX-alpha and FX-beta globin gene sequences were combined with 200 ng of double cut, gel purified M13mp19-RF (New England BioLabs, Inc., Beverly, Md.) and ligated overnight at 15° C. in 50 uL ligation buffer (IBI, Inc., New Haven, Conn.) containing 2 U of T4 ligase. FX-Alpha globin was cloned into the Xmal/PstI sites of M13mp19. FX-Beta globin was cloned into the PstI/HindIII sites of M13mp19.

E. coli JM109 was transformed with the M13mp19 ligation mixture containing the FX-alpha or FX-beta globin gene sequences using the following transformation protocol: Nine microliters of DMSO was added to 0.25 mL of competent E. coli JM109 and the cells were incubated on ice for 10 min. Aliquots of the FX-alpha or FX-beta globin ligation reactions were added and incubated on ice for 40 minutes and at 42° C. for 3 min. One hundred microliters of a JM109 overnight culture was added to each transformation mix followed by 60 μL of a solution containing 50 μL of 2% (w/v) 5-bromo-4-chloro-3-indolyl galactopyranoside in dimethylformamide and 10 μL of 100 mM isopropylthiogalactoside. Molten B-top agar (10 g Bacto tryptone, 8 g NaCl, 6 g agar per liter), 2.5 mL was added and the mixture poured onto a B-bottom agar plate (10 g Bacto tryptone, 8 g NaCl, 12 g of agar per liter). Following incubation overnight at 37° C. colorless plaques (i.e., clones containing inserted DNA) were removed from the plates using sterile transfer pipettes and inoculated into 1 mL of a JM109 overnight culture diluted 1:100 in 2xYT media.

Bacteriophage clones were grown at 37° C. for 6 to 8 hrs. and centrifuged in a microcentrifuge for 5 minutes. The cell pellets were processed for M13mp19 RF using the alkaline-SDS method and resuspended in 20μL of TE buffer containing 20μg/mL DNAase free RNAase (Sigma, St. Louis, Mo.). Aliquots (1-3 μL) of the RF preparations were digested with 10-20 units each of the appropriate restriction enzymes (see above) and analyzed on 0.8% agarose electrophoresis gels (see above).

M13mp19 clones containing the correct size inserts were grown in larger quantities to obtain single stranded phage DNA for sequencing. Thirty-five milliliters of 2xYT medium was inoculated with 0.3 mL of a JM109 overnight culture. After growth for 1 hour at 37° C., the 35 mL culture was inoculated with 200 μL of the appropriate phage-containing supernatant. The culture was incubated for 6 hours at 37° C. and the culture supernatant collected by centrifugation at 9000 rpm for 10 min in a JS13.1 rotor. The phage were precipitated from 31 mL of supernatant by addition of 5 mL 4M NaCl and 4 mL 40% (w/w) polyethylene glycol 6000 and recentrifuged as above. The phage pellet was resuspended in 0.4 mL of TE, extracted with phenol/chloroform/isoamyl alcohol (50:49:1 (v/v)) three times, chloroform/isoamyl alcohol (49:1 (v/v)) one time, and ethanol precipitated. The DNA pellet was resuspended in 20 μL TE and quantitated spectrophotometrically by A₂₆₀. One microgram of phage DNA was used per set of sequencing reactions.

Sequencing was by the dideoxy method of Sanger (Sanger, F. S. et al. Proc. Nat. Acad. Sci. USA 1977, 74, 5463-5467) with M13 -20 and M13 -40 universal primers (New England BioLabs, Beverly, Mass.) and gene specific primers (for FX-alpha globin the following primers were used: alpha-1 5'-CGTATGTTCCTGTCTTT-3'; alpha-2 5'-ACAAACTGCGTGTTGAT-3'; for FX-beta globin the following primers were used: beta-1 5'-GCTGGTTGTTTACCCGT-3'; beta-2 5'-ACCCGGAAAACTTCCGTC-3').

1.4 Expression Vectors pDL II-62m and PDL II-10a

The appropriate sequences were excised from the M13mp19 vector and cloned into the pKK-223-3 (Pharmacia/LKB, Piscataway, N.J.)) expression vector under control of the Tac promoter (Brosius, J. and Holy, A. Proc. Nat. Acad. Sci., USA 1984, 81, 6929-6933). A DNA sequence encoding alpha globin was removed by cutting with EcoRI and PstI. The globin containing fragment was gel purified and ligated into EcoRI and PstI double cut, gel purified pKK-223-3 using methods described above. E .coli JM109 cells were transformed with the ligation reaction containing the desired FX-alpha globin sequence (pDL II-62m) (FIG. 1) and selected for by growth on 2xYT media containing ampicillin (100 g/mL). Individual clones were screened for the presence of the desired insert (yielding pDL II-62m) by alkaline-SDS purification of plasmids and restriction analysis with the enzymes EcoRI and PstI using the methods described above. Cloning of the FX-beta globin sequence into the pKK-223-3 expression vector, yielding pDL II-10a (FIG. 1) was done analogously except that restriction analysis and cloning was done with PstI and HindIII.

EXAMPLE 2 Separate Expression of Synthetic FX-Alpha and FX-Beta Globin

To assess the expression of the individual FX-globin gene products, E. coli JM109 clones transformed with either pDL II-62m or pDL II-10a were inoculated into 2 mL of TB media containing ampicillin (100 μg/mL). The inoculum was grown at 37° C. for 3-4 hours, then divided into two 1 mL aliquots. One of the aliquots was induced by the addition of IPTG (1 mM, final) and grown for an additional 3-4 hours. The cells were collected by centrifugation, resuspended in 0.5 mL SDS-PAGE loading buffer and heated at 85° C. for 10 minutes. The total cell protein mixture was electrophoresed on 15% SDS-polyacrylamide gels using authentic hemoglobin as a molecular weight standard.

EXAMPLE 3 Polycistronic Coexpression of FX-alpha and FX-beta Globin Gene Products from the Same Operon (PDLII-66a) And Conversion of FX-Hemoglobin to Hemoglobin

To achieve coexpression of FX-alpha and FX-beta globins from a single polycistronic operon, the FX-beta globin sequence from pDL II-10a was excised with HindIII and PstI, gel purified, and ligated into PstI/HindIII cut and gel purified pDL II-62m. Ligation and transformation conditions were identical to those described above. Note that each globin cistron was preceeded by an "introduction" cistron as previously described, so that the entire Tac promoter driven operon had four cistrons. Clones were individually examined for the presence of both FX-alpha and FX-beta globin genes by digestion of plasmids with EcoRI and separation of fragments by electrophoresis through 0.8% agarose. Plasmids containing both genes (pDL II-66a, FIG. 1) produced a fragment of approximately 1.0 kb following EcoRI digestion.

Clones containing pDL II-66a were grown in 2 mL of TB mediα containing ampicillin (100 μg/mL) for 4 hours at 37°. The culture was divided into two 1 mL aliquots, one of which was induced with 1 mM IPTG. Incubation was continued for 4 hours. Total cell protein extracts for both the uninduced and induced clones were examined by SDS-PAGE electrophoresis to confirm the coexpression of both FX-alpha and FX-beta globin.

To determine if the coexpression of both gene products resulted in the formation of tetrameric FX-alpha globin₂ FX-beta globin₂ protein, the following experiments were performed. Two liters of TB medium containing ampicillin (100 μg/ml) was inoculated with 20 mL of an overnight culture of an FX-alpha/FX-beta expressing E. coli clone and grown to an optical density at 600 nm (OD₆₀₀) of 2.1 at 37° C. The culture was induced with IPTG (2.5 mM final concentration) and grown to an OD₆₀₀ of 3.5.

The cells (40 gm) were collected by centrifugation at 10,000× g and suspended in 80 mL of lysis buffer (50 mM Tris-HCl, pH 8.0, 25% sucrose, 1 mM EDTA). Ten milliliters of lysozyme solution (18 mg/ml in lysis buffer) was added and the mixture incubated on ice for 30 min. MgCl₂, MnCl₂, and DNAse I (Sigma, St. Louis, Mo.) were added to final concentrations of 10 mM, 1 mM and 10 μg/mL, respectively. The cells were incubated at room temperature for 1 hour and an equal volume of a solution containing 1% one percent deoxycholic acid, 1% Nonidet P-40, 20 mM Tris-HCl pH 7.5, 2 mM EDTA was added to the lysate.

Particulate material was removed by centrifugation at 10,000× g for 10 min. The supernatant (-200 mL) was bubbled with carbon monoxide for 5 min and dialyzed overnight against 4 liters of 10 mM NaPO₄ buffer, pH 6.0. The cell-free extract was clarified by centrifugation at 10,000× g for 10 min. To the supernatant was added 20 g DE-52 (Whatman. U.K.). The pH of the suspension was adjusted to 7.0 and the ion exchange resin was removed by centrifugation. The pH of the supernatant was readjusted to 6.0 and the supernantant was loaded onto a CM-cellulose column (2.5×15 cm) equilibrated in 10 mM NaPO₄, pH 6.0 at 4° C. The column was washed with two bed volumes of 10 mM NaPO₄, pH 6.0 followed by a linear gradient of 10 mM NaPO₄ l pH 6.9 to 20 mM NaPO₄ l pH 9.0 (400 mL total volume). Fractions 36 to 42 contained a red solution and were combined; an aliquot of this solution was scanned from 650 nm to 400 nm revealing a spectrum identical to that for carboxyhemoglobin (FIG. 6). An aliquot of the same peak was analyzed by SDS-PAGE electrophoresis with hemoglobin as molecular weight standard and was found to contain two protein bands of approximate MW 15,500 and 16,200. As expected, these bands migrated at a slightly slower rate than authentic hemoglobin (alpha MW=15,100; beta MW=15,850) presumably due to the five amino acid extension of the Factor X recognition sequence. On this basis the material was designated FX-hemoglobin.

FX-Hemoglobin (2.0 mg) was digested at room temperature for 2 hours in 3 mL of 20 mM HEPES, pH 7.4, 0.1 M NaCl, 10 mM CaCl₂ containing 2mg of trypsin (Sigma, St. Louis, Mo., 10,000 units/mg). SDS-/PAGE electrophoresis confirmed the conversion of FX-hemoglobin to material that comigrates with native hemoglobin.

EXAMPLE 3A Distribution of FX-Globin Products from E. coli Expression

One hundred milliliter cultures of E. coli clones expressing Fx-alpha globin (plasmid pDL II-62m), Fx-beta globin (plasmid pDL II-10a) and Fx-hemoglobin (plasmid pDL II-66a) were started with 1 ml inocula of overnight cultures. After growth for 4 hours, protein expression was induced by addition of IPTG to 1 mM final concentration. Incubation was continued for an additional 3 hours. The cells were collected by centrifugation, weighed, and lysed as described above except that DNAase treatment was done for 30 minutes on ice. The samples were centrifuged at 5000× g, 10 min and the supernatants (representing the soluble protein fraction) brought to 2 ml final volume with H₂ O and frozen at -80° C. The pellets (representing the insoluble protein or inclusion body fraction) were washed twice in 5 mL 0.5% Triton X-100, 1 mM EDTA, resuspended in 2 ml H₂ O and frozen at -80° C.

Analysis of protein distribution was accomplished by SDS-PAGE and Western blotting. The primary antibody was rabbit anti-human hemoglobin IgG. The western blotting protocol was according to the manufacturer's recommendations (Proto Blot Western Blot AP system, Promega Corp., Madison, Wis.). Samples of soluble and insoluble protein representing 60 μg of wet cell weight were analyzed from the FX-alpha and FX-beta expressing clones. Due to the greater level of expression in the FX-hemoglobin clone, material representing only 15 μg of wet cell weight was analyzed.

As seen in Table 11 the distribution of the proteins varied. Fx-alpha globin was detectable only in the soluble fraction while FX-beta globin partitioned between the insoluble and soluble fractions of the cell. FX-Hgb, which stabilizes each separate subunit, was found only in the soluble fraction, and at a concentration at least 2.6 times that of the individually expressed subunits. These results indicate that FX-beta globin is totally soluble only when allowed to assemble with FX-alpha globin.

EXAMPLE 4 Construction And Expression of Mutant FX-Alpha/FX-Beta Tetrameric Hemoglobin Expression Vector

Hemoglobin Beth Israel: pDL II-10 a was digested with restriction enzymes SacI and SpeI, gel purified, and isolated by electro-elution. Oligonucleotides incorporating the appropriate codon change for Hemoglobin Beth Israel (beta¹⁰² asn→ser) (FIG. 7) were synthesized as previously described above to bridge the SacI to SpeI restriction sites. Purification and quantitation of the individual oligonucleotides was as previously described. The complementary oligonucleotides were annealed by heating to 95° C. for 10 min. followed by slow cooling to room temperature over a 2 hour period. An aliquot of the annealed mixture was then combined with the SacI/SpeI digested, gel purified pDL II-10a plasmid at molar ratios similar to those used for the initial FX-alpha and FX-beta cloning. T4 DNA ligase (2 U) was added and the mixture incubated for 1 hour at room temperature. E. coli JM109 was transformed with this ligation mixture as previously described. Individual clones were isolated, and plasmids purified and sequenced using primer beta-l (vida supra). Plasmid sequencing was done with a Sequenase kit (United States Biochemical Corp., Cleveland, Ohio) following the protocol supplied by the manufacturer. The appropriate mutated beta globin sequence was then excised with HindIII and PstI, gel purified, and cloned into pDL II-62m as described above.

Other hemoglobin mutants: The synthetic genes encoding Hemoglobin Cheverly (beta⁴⁵ phe→ser) Hemoglobin Providence/MSR (beta⁸² lys␣asp) and Hemoglobin beta⁶⁷ val→ile and Hemoglobin Kansas (beta ¹⁰² asn→thr) were prepared similarly except with synthetic oligonucleotides spanning the SacII→BglII, SalI→SpeI, NcoI→KpnI and SacI→SpeI restriction sites respectively (FIG. 7). Synthesis of the mutant oligonucleotides, restriction enzyme digestion, gel purification, and ligation conditions were identical to those used for Hemoglobin Beth Israel. All mutations were first cloned into plasmid pDL II-10a, appropriate clones were sequenced, and the mutated beta globin gene was subcloned into PstI and HindIII digested pDL II-66a. Plasmid sequencing was accomplished as described previously. E. coli cells were transformed, cultured, and induced as previously described. FX-hemoglobin mutants were purified by the method of Example 3. Oxygen binding of purified hemoglobin mutants is shown in Table 9.

EXAMPLE 5 Production of Synthetic Met-Alpha/Met-Beta Hemoglobin

5.1 Construction of Des-FX Alpha and Des-FX Beta Globin Genes

We established that E. coli could produce tetrameric fusion-hemoglobin. Elimination of the DNA sequences coding for the Factor Xa substrate recognition site on the N-terminal end of each peptide should result in production of synthetic (FX free; "des-FX") hemoglobin containing the N-terminal methionine as its only extra residue. pDL II-62m was digested with EcoRI and PstI to excise the sequence containing the FX-alpha globin gene. The FX-alpha globin gene was then gel purified. pGEM-1 (Promega Corp.) was linearized with EcoRI and PstI, gel purified, and the FX-alpha globin gene ligated into the plasmid as described previously. Clones (pGEM-FX-A) containing the FX-alpha globin gene in pGEM-1 were identified by digestion of purified plasmids with EcoRI and PstI followed by agarose gel electrophoretic analysis. FX-Alpha pGEM was digested with NdeI and EagI to remove the Factor X_(a) coding sequence (FIG. 5). oligonucleotides containing the DNA sequence encoding native alpha globin were synthesized with ends compatible to NdeI and EagI (FIG. 8) restriction sites. After synthesis, the oligoncleotides were purified, annealed, and ligated as described above. The sequence of pGEM-desFX-alpha (pDL II-83a) was confirmed by dideoxy-sequencing of the plasmid using the T₇ promoter primer (Promega Corp., Madison, Wis.). A clone containing the correct sequence was then digested with EcoRI and PstI. The des-FX alpha globin gene was gel purified, cloned into EcoRI/PstI digested, gel purified pKK-223-3 to generate pDL II-86c. E. coli strain JM109 was transformed with the ligation mixtures and expression of des-FX alpha globin by individual clones was determined by analysis of total cell protein extracts of induced culture inocula (see above). Des-FX alpha globin, in contrast to FX-alpha globin, co-migrates with authentic alpha globin on SDS-PAGE.

The des-FX beta globin sequence was prepared in an analogous fashion using the FX-beta globin gene excised from pDL II-10a with PstI and HindIII. This gel purified beta globin sequence was then ligated into pGEM-1 that had been digested with the same two enzymes. A pGEM-1 plasmid containing the FX-beta globin gene was digested with NdeI and SacII, gel purified, and used for construction of the des-FX beta globin gene. The oligonucleotides conferring the desired sequence (FIG. 20) were synthesized, purified, annealed, and ligated into NdeI, SacII cut pGEM FX-beta to form pGEM-des-FX beta (pDL III-6f) as described above. After confirming that the sequence was correct the des-FX beta gene was removed with PstI and HindIII and gel purified. Des-FX beta was then ligated into the des-FX alpha-containing plasmid pDL II-86c using the PstI/HindIII sites. Clones containing both des-FX alpha and beta globin genes (pDL III-13e) (FIG. 9) were confirmed by EcoRI digestion of purified plasmids (see above), and screened for expression by comparison of IPTG induced and non-induced cultures. Des-FX hemoglobin co-migrates with native hemoglobin on SDS-PAGE.

5.2 Characterization of Met-Hgb (Des-FX Hgb)

Recombinant methionyl-Hgb has different reactivities than Hgb Ao in the presence of chloride and phosphate ions (Table 10) and with changes in hydrogen ion concentration (FIG. 10). The reason for this is thought to be the additional amino acid, methionine, on the N-termini of both globins. The N-terminal amino group of the alpha chain is important for the change in P₅₀ by phosphate ion. Displacement of the N-terminal amino group in space or changing its electronic state by the addition of the methionine can alter these effects.

The increase in P₅₀ seen in the presence of inositol hexaphosphate ion is irrelevant for any Hgb found in solution because the concentration of monophosphate ion found in plasma is not enough to significantly increase the P₅₀. There is no inositol hexaphosphate ion found in plasma. The increase in P50 needed for Hgb in solution to effectively off-load oxygen can be best attained by incorporating mutations to make Hgb lower in affinity for oxygen.

The magnitude of change in P₅₀ with respect to chloride ion is not important physiologically. The effect, which is interesting in terms of its biochemical mechanism, does not add a significant amount of oxygen off-loading capacity to Hgb in solution. Some animal Hgb have extremely small chloride effects. Again, the chloride effect is thought to be transmitted by the alpha globin N-terminal amino group; with methionyl-Hgb that effect could be changed because of the stearic change or because the pKa of the methionine amino group is different than that of valine.

The P₅₀ of Hgb is normally most dramatically changed by hydrogen ion concentration. The so-called "Bohr effect" is also thought to, in part, involve the N-terminal amino group of the alpha globin. Numerous mutant human Hgb molecules have been shown to have altered Bohr effect changes without any physiological deficiency. There is an advantage in having a limited Bohr effect, as well as phosphate and chloride effects, in a Hgb molecule to be used in solution for a number of different medical and biochemical applications. In terms of the Bohr effect, there are several applications for methionyl-Hgb where it could be used in the more alkaline pH range, e.g., tissue culture, organ perfusion. Figure 10 depicts a preliminary experiment which indicated that the P₅₀ is actually greater for methionyl-Hgb than for Hgb Ao at greater than pH 7.8. It will be noted from FIG. 10 that the slope of the P₅₀ to pH plot for Met-Hgb is shallower than that for Hgb A_(o), i.e., the Bohr effect is smaller. Subsequent experiments suggest that the difference in Bohr effect between Met-Hgb and Hgb A_(o) is smaller than that shown in FIG. 10.

The main advantage for using a Hgb molecule with fixed changes in P₅₀ relative to pH, chloride and phosphate, is that the practitioner will know the oxygen off-loading capacity of the formulation without regard for the specific conditions of its use.

EXAMPLE 6 Synthesis of Synthetic Hemoglobin (Des-Val Hgb) of Native Size

6.1 Construction of Des-Val-All ha (pDL II-91f) and Des-Val-Beta (pDL II-95a) Globin Genes

DNA sequences encoding the globin genes in which the N-terminal valine codon in each gene is replaced by an ATG (methionine) codon were constructed in a manner analogous to t he des-FX clones except t hat the oligonucleotides inserted (FIG. 8) were ones encoding the amino acid sequences "met-leu . . . " and "met-his . . . " for alpha and beta globin genes, respectively. Following confirmation of correct sequence for both des-val alpha (pDL II-91f) and des-val beta (pDL II-95a) genes in pGEM-1, the des-val alpha globin gene was cloned into the EcoRI/PstI cut pKK-223 (see above) to create pDL III-1a. The des-val beta globin gene from PDL II-95a was then cloned into pDL III-1a using the PstI/HindIII restriction sites.

More specifically, the des-val alpha transfer vector was prepared from plasmid pDL II-62m as follows. Plasmid pDL II-62m was digested with EcoRI and PstI to excise the fragment containing the FX alpha globin gene. The FX alpha globin gene was then gel purified. The plasmid pGEM-1 (Promega Corp.) was linearized with EcoRI and PstI, gel purified, and the FX alpha globin gene ligated into the plasmid as described previously. It was necessary to subclone into pGEM-1 because redundant restriction sites in pKK 223-3 prohibited the removal of the FX coding sequence directly from the individual FX-alpha and FX-beta globin genes. Clones containing the FX alpha globin gene in pGEM-1 were identified by digestion of purified plasmids with EcoRI and PstI followed by agarose gel electrophoretic analysis. FX alpha pGEM-1 was digested with NdeI and EagI to remove the FX alpha coding sequence and oligonucleotides containing DNA sequences coding for native alpha globin in which the N-terminal valine is replaced by a methionine were synthesized with ends compatible to NdeI and EagI restriction sites. After synthesis, oligonucleotides were purified, annealed and ligated as described above. The sequence of PGEM des-val alpha (plasmid pDL II-91f) was confirmed by dideodoxy sequencing of the plasmid using the t7 primer (Promega corp., Madison, Wis.). A clone containing the correct sequence for des-val alpha (pDL II-91f) was then digested with EcoRI and PstI. The des-val alpha globin gene was gel purified, and ligated into EcoRI/PstI digested, gel purified, pKK-223-3 to generate plasmid pDL III-1a.

The des-val beta globin transfer vector was prepared in an analogous fashion using the FX beta globin gene of plasmid pDL II-10a. FX-beta was excised from pDL II-10a with PstI and HindI. This gel purified beta globin sequence was then ligated into pGEM-1 that had been digested with the same two enzymes. A pGEM clone containing the FX beta globin gene was digested with NdeI and SacII, gel purified, and used for construction of the des-val beta globin gene. The oligonucleotides encoding the desired sequence for des-val beta were synthesized, purified, annealed, and ligated into NdeI, SacII cut, pGEM FX-beta to form pGEM des-val beta (plasmid pDL II-95a). After confirming that the sequence was correct for des-val beta globin, the gene was removed with PstI and HindIII and gel purified.

6.2 Preparation of DDL III-14c and III-38b (des-Val-alpha/des Val beta Polycistronic Gene Clones)

Plasmid PDL III-1a containing the Des-Val alpha globin gene was digested with PstI and HindIII and gel purified. The Des-Val beta globin gene was removed from pDL II-95a using the same method. Following ligation and transformation, individual clones containing the Des-Val alpha/des-val beta globin coexpressing plasmid pDL III-14c were analyzed for Des-Val hgb production by IPTG induction and SDS-PAGE.

Plasmid pDL III-38b which contains the Des-Val beta globin gene 5' to the Des-Val alpha globin gene was then constructed and analyzed.

Plasmid pDL III-1a containing the des-val alpha globin gene was linearized with the restriction enzyme SmaI. The plasmid was then treated with bacterial alkaline phosphatase to remove the 5'-phosphate groups, phenol extracted, ethanol precipitated, and resuspended in TE buffer as above. A pGEM-1 clone (pDL II-95a) containing the des-val beta gene was digested with HindIII, phenol extracted, ethanol precipitated and resuspended in a ligation mixture containing a 50:1 molar ratio of a HindIII-SmaI linker to plasmid. This ligation mixture was then digested with SmaI and the beta globin fragment now containing SmaI restriction sites on both the 5'-and 3'- ends was gel purified and added to a ligation reaction containing the linearized form of plasmid pDL III-1a, above, thus obtaining plasmid pDL III-38b. The orientation of the alpha and beta globin genes in pDL III-14c and pDL III-38b was confirmed by restriction analysis.

Cells of E. coli strain JM109 were transformed with pDL III-14c or pDL III-38b and grown in 2xYT mediα containing ampicillin. Colonies were induced with IPTG as above. Individual clones were analyzed for their ability to produce des-val alpha and des-val beta globin polypeptides by SDS-PAGE and Western blotting. There was no appreciable difference between expression of immunoreactive des-val alpha or des-val beta globins from the alpha→beta orientation (pDL III-14c) or the beta→alpha orientation (pDL III-38b).

EXAMPLE 7 Analysis and Comparison of the Functional Characteristics of Recombinant Des-FX and Des-Val Hemoglobins

JM109 cells expressing either des-FX hemoglobin (dFX-hgb) or des-Val hemoglobin (dV-hgb) were grown to an OD600 of 15 in a 10 liter fermenter and then induced by the addition of 300 μm IPTG. Induction period was for 6 hrs. Cells were harvested by centrifugation and frozen at -80° C. until processed.

For purification of hgb approximately 200g of cells were resuspended in 350 mL of 50 mM sodium phosphate (NaPi) buffer, pH 7.0 containing the 200 units of aprotinin/mL and 20 μg/ml DNAase 1. Cells were then lysed by the addition of 1 mg/mL of lysozyme and 4 passages through a Dynomill. Cellular debris was removed by centrifugation at 10,000 rpm in a Beckman JA14 rotor at 4° C for 40 min. The supernatant was added to 200 mL of a hemoglobin-binding resin equilibrated with 10 mM NaPi, pH 7.0. The pH was adjusted to 7.0 with 10M NaOH or concentrated phosphoric acid. The resin was then loaded into a 5×30 cm chromatography column and allowed to settled. The column was then washed with 2.5 column volumes of 10 mM NaPi. nH 7.0 containing 100 units/mL aprotinin. Hgb was eluted from the column in 20 mM Tris-HCl, pH 7.5 containing 100 units/mL aprotinin. This partially purified hgb was then 0.2 micron filtered and loaded onto a 1.6×10 cm Mono-Q anion exchange column equilibrated with 20 mM Tris-HCl, pH 8.0. Hgb was eluted using a linear gradient of 0 to 0.4M NaCl in 20 mM Tris-HC1, pH 8.0. The material was then loaded onto a 1.6×10 cm Mono-S cation exchange column. Hemoglobin was eluted with a linear gradient of 10 mM NaPi, pH 7.0 to 10 mM NaPi, pH 8.5, 160 mM NaCl. The major peak of hgb was collected, concentrated to approximately 100 mg/mL. and used for analysis.

Functionality of the recombinant hgb was evaluated using a Hemox analyzer at 25° C. in 50 mM HEPES, pH 7.4 containing 0.1M Cl. The following oxygen binding data were obtained:

    ______________________________________                                         SAMPLE           P.sub.50                                                                              N                                                      ______________________________________                                         Ao               4.3    2.9                                                    dFX-hgb          3.3    2.6                                                    dV-hgb           6.6    2.7                                                    ______________________________________                                    

Of significance in these data are the following:

1) The addition of an extra amino acid, methionine, on the N-termini of alpha and beta globins (dFX-hgb) appears to reduce slightly the P₅₀ of the molecule but has little effect upon cooperativity (N).

2) Replacement of the N-terminal valines of alpha and beta globins with methionine (dV-hgb) increases the P₅₀ of the molecule but has little effect upon the cooperativity (N).

EXAMPLE 8 Polycistronic Co-expression of Des-Val-Alpha/Alpha (Di-Alpha) and Des-Val Beta Globin

The overall synthetic plan for the preparation of a plasmid (pDL III-47a) which co-expresses di-alpha globin and beta globin is given below. The starting materials are the commercially available transfer vectors M13mp19-RF, pKK 223-3 and pGEM-1, and the synthetic oligonucleotides described in the examples.

For convenience, our manipulations began with plasmids, pDL II-62m and pDL II-10a. Plasmid pDL II-62m was obtained by cloning the "FX alpha globin" gene into pKK 223-3 downstream of the Tac promoter. Plasmid pDL II-10 a was prepared by an analogous insertion of the "FX beta-globin" gene.

As set forth in greater detail in FIG. 12, the "FX alpha globin" operon encodes two cistrons, the first expressing an octapeptide "leader", and the second, alpha globin preceded by Met-Ile-Glu-Gly-Arg. The latter four amino acids constitute a recognition site for Factor X cleavage activity. The "FX beta globin" operon is similarly constructed.

Since the Factor X recognition site was not needed here, the genetic material was manipulated to excise the "FX" codons. (This could have been avoided by synthesizing the desired di-alpha globin and des-val-beta globin genes directly rather than using the FX-alpha and FX-beta genes of pDL II-62m and pDLII-10a.) The FX-alpha globin gene cassette was excised and cloned into pGEM-1 to obtain pGEM FX-alpha (PGEM FX-A). Similarly, the FX-beta gene of pDL II-10a was transferred to pGEM-1 to obtain pGEM FX-beta (pGEM FX-B).

The recognition site (FX)-encoding sequence could now be removed from pGEM FX-alpha and pGEM FX-beta to obtain pDL II-91f and pDL II-95a, respectively. The des-val alpha globin gene of pDL II-91f was recloned into pKK 223-3 to generate pDL III-1a, the gene being operably linked to the Tac promoter of pKK-223-3. The des-val beta globin gene of pDL II-95a was purified and inserted downstream of the des-val alpha globin gene of pDL III-1a to form a single transcriptional unit which would encode a polycistronic alpha globin/beta globin mRNA, see pDL III-14c. Finally, a synthetic oligonucleotide comprising the desired di-alpha linker encoding sequence and another copy of the alpha globin gene was inserted into pDL III-14c to create pDL III-47a, wherein a Tac promoter controls transcription of a di-alpha globin gene and a des-val beta globin gene.

8.1 Preparation of PDL III-47a (Di Alpha/Beta Globin Clone)

The EagI and PstI restriction fragment containing most of the alpha globin gene from the plasmid pDL II-91f was gel purified and ligated to a synthetic DNA linker containing the sequence from the BstBI site of the alpha globin gene to the codon (wild-type Arginine) for its carboxyl terminus, a variable glycine-encoding linker (for example, FIG. 12, RGGV, a di-glycine followed by α Val; other possibilities include RGM, RGV, RGGV, etc., See Table 200), and the codons for the amino terminal region of alpha globin to the EagI site (FIG. 12). After digesting this ligation mixture with Pst I, the resulting fragment was cloned into BstBI/PstI-cut pDL III-14C to create plasmid pDL III-47a (RGM-di-alpha). Plasmids pDL III-82a (RGGV-di-alpha), pDL IV-8a (RGV-di-alpha), pDL IV-976 (RV-di-alpha) and pDL IV-66a (RGGGV-di-alpha) were similarly constructed to incorporate the indicated changes in the di-alpha coding sequences.

8.2 Expression of Di-alpha/Beta Hemoglobin

Individual E. coli clones were analyzed by Western blotting for production of dimeric alpha globin protein in combination with monomeric beta globin. Appropriate plasmid construction was confirmed by digestion with EcoRI and 0.8% agarose gel electrophoresis. The EcoRI fragment present in the di-alpha constructs is approximately 1450 bp.

Expression of genetically fused hemoglobin was accomplished using the IPTG induction protocol and S-sepharose purification of recombinant hemoglobin. E. coli cells (400 ml) were grown to an OD₆₀₀ of 3.0 and induced with 1 mM IPTG. The cells were allowed to continue to grow for another 4 hours and then harvested by centrifugation. The cell pellet was resuspended in the 10 mM sodium phosphate, pH 6.0 containing 1 mM benzamidine, 1 mM EDTA and 0.1% Triton-X100. The cell suspension was then sonicated, centrifuged at 15,000× g for 15 minutes, and the supernatant loaded on to an S-Sepharose column equilibrated with 10 mM sodium phosphate pH 6.0. After the sample was loaded on the column, the column was washed with 10 bed volumes of 10 mM sodium phosphate pH 6.8. The dialpha hemoglobin was eluted from the column with 10 mM sodium phosphate, pH 7.4, 30 mM NaCl. Confirmation of hemoglobin production was accomplished by visible light spectroscopy, SDS-PAGE, and Western blot analysis of purified material.

Oxygen binding measurements were made at 37 and 25° C. in a Hemox Analyzer (Southampton, Pa.). The solutions were 50 mM HEPES, pH 7.4, 0.1M NaCl. The measured P50's and Hill coefficients are given below:

    ______________________________________                                                      P50                TEMP                                           Protein      (torr)      N      (°C.)                                   ______________________________________                                         RV-di-alpha  2.2         1.73   25                                                          5.0         1.49   37                                             RGV-di-alpha 4.5         2.04   25                                                          9.9         2.07   37                                             RGGV-di-alpha                                                                               8.2         2.39   25                                                          15.0        2.22   37                                             RGGGV-di-alpha                                                                              7.1         2.57   25                                             ______________________________________                                    

EXAMPLE 9 Preparation of Di-Alpha Hemoglobin Low Affinity Mutants

In order to reduce the oxygen affinity of recombinant di-alpha hemoglobin, several mutations were introduced into the beta globin polypeptides using synthetic oligonucleotides. The restriction sites used to incorporate these mutants are shown in Table 3. For insertion of the Nagai (beta Val 67→Ile) and Arg-Nagai (also beta Lys 82→Arg) mutations, the des-Val-beta plasmid pDL II-95a was digested with the restriction enzymes NcoI and KpnI and gel purified. Oligonucelotides spanning these two restriction sites and containing the appropriate codon changes were synthesized, purified, annealed and ligated into the gel purified plasmid. Following confirmation of correct sequence, the mutant des-Val-beta globin gene was excised with PstI, HindIII, gel purified, and cloned into plasmid pDL III-47a. The beta globin gene containing the Kansas mutation (beta Asn 102→Thr) was similarly constructed using SacI and SpeI restriction sites. Mutated codons for all of these beta globin mutations are shown in lower case letters in Table 3.

EXAMPLE 10 Characterization of Di-Alpha Hemoglobins Oxygen Binding

Oxygen binding measurements were made at 37° C. in a Hemox Analyzer (Southampton, Pa.). The solutions were 50 mM Bis-Tris, pH 7.4, 0.1M NaCl and 60 μM heme equivalents of di-alpha hemoglobin. The solutions were measured between 120 and 1.5 torr oxygen pressure. P₅₀ values are given in Table 4.

In Vivo Half Life

Di-Alpha Hgb (wild type) containing a gly-gly linker between alpha₁ and alpha₂ (RGGV-di-alpha, pDL III-82a) was prepared as described previously. The protein was formulated in 20 mM NaPO₄, pH 7.4 at a concentration of 95 mg/ml. Di-alpha Hgb was infused into male Sprague-Dawley rats 388-426 gm) at a dose of 875 mg/kg through a central venous catheter over 20-30 sec. Samples of blood were drawn at 2, 30, 80, 90, 120, 150, 180, 210, and 240 min. into heparinized vials. The blood was centrifuged to remove red blood cells and the plasma hemoglobin was assayed by absorbance at 540 nm. The percent hemoglobin remaining versus time was determined by comparison to the 2 min time point which was assumed to be a homogeneously mixed sample. The same experiment was repeated with non-crosslinked des-val Hgb at a concentration of 100 mg/ml. The data were averaged for each sample and plotted as percent Hgb remaining against time after infusion. The measured half-lives were 205 min and 104 min respectively for di-alpha Hgb and des-val Hgb, respectively.

EXAMPLE 11 cl Co-expression of Wild Type Des-Val Alpha Globin and Di-Alpha Globin with Des-Val Beta Globin Containing the Presbyterian Mutation

Plasmid pSGE1.0-E4 is shown in FIG. 34 and contained the following modifications as compared to other expression vectors derived from plasmid pKK223-3:

1) The plasmid now contains a functional tetracycline resistance gene.

2) The laci gene which encodes for the lac repressor protein has been incorporated into the plasmid. The lac repressor protein represses the TAC promoter until induction with IPTG. The repressor gene was inserted into the plasmid to permit transformation of E. coli cell lines which do not have endogenous lac repressor genes.

The desVal beta globin gene in pSGE1.0E4 containing the Presbyterian mutation had been constructed by insertion of a complementary pair of synthetic oligonucleotides between the Sacl and Spel restriction sites of pDL II-95a.

The following oligonucleotides (Pres-A and Pres-B) were used to construct the Presbyterian mutation in dVal-beta globin. ##STR5##

Following digestion with the two restriction enzymes pDLII-95a was gel purified to remove the wild type encoding DNA fragment. The annealed oligonucleotides were then ligated into the plasmid to create plasmid pJRV-83a. The HindIII to PstI fragment from this plasmid was then ligated into pSGE0.0E5 (Table 200, #45a) to create pSGE1.0E5(Table 200, #46). The BamHI fragment encoding the tet R 5' end was then ligated into pSGE1.0E5 to create pSGE1.0E4. Following transformation of JM109 cells, individual colonies were selected and analyzed for production of alpha and beta globins by IPTG induction and SDS-PAGE. Dideoxynucleotide sequencing was used to confirm the presence of the Presbyterian mutation.

The mutant hemoglobin was produced, purified, and analyzed as described in Example 7 above. The oxygen affinity (P₅₀) and cooperativity (N) of wild-type Hgb A₀ (purified from blood), dV-Hgb expressed from pSGE0.0E4 (FIG. 14), and dV-Hgb containing the Presbyterian mutations expressed from pSGE1.0E4 (FIG. 34), were compared:

    ______________________________________                                         SAMPLE            P.sub.50                                                                              N                                                     ______________________________________                                         Ao                4.3    2.9                                                   dV-hgb            6.6    2.7                                                   dV-hgbPres        19.8   2.5                                                   ______________________________________                                    

It should be noted that the Presbyterian mutation, which results in the change of beta asparagine 108 to lysine, cause a large decrease in the affinity of the molecule for oxygen but does not substantially affect the cooperativity of the molecule.

The beta chain Presbyterian mutation has also been co-expressed with di-alpha globin containing a single glycine linker utilizing plasmid pSGE1.1-E4 (FIG. 15). The result shown below indicates that the joining of the carboxy terminus of alpha 1 to the amino terminus of alpha 2 has relatively little effect on oxygen binding and cooperativity.

    ______________________________________                                         SAMPLE            P.sub.50                                                                              N                                                     ______________________________________                                         dV-hgbPres        19.8   2.5                                                   dialpha/Pres      17.2   2.4                                                   ______________________________________                                    

EXAMPLE 12 Construction of a Two Promoter System for the Co-expression of Di-Alpha Globin and Beta Globin

In this example, a di-alpha globin gene is operably linked to one promoter and a beta globin gene to a second promoter, but both genes reside on the same vector. Compare Examples 16 and 17, infra.

Oligonucleotides (see below) encoding the sequence of the complementary strands of the TAC promoter (syn pTAC) and appropriate restriction enzyme sites were synthesized, gel purified, and annealed. Notice that the sequence complementary to the Xbal restriction enzyme site was designed to eliminate this restriction site when ligated into an authentic Xbal site. This was done to facilitate future manipulation of syn pTAC.

Sequence of syn pTAC ##STR6##

Plasmid pDL III-47a (FIG. 13) was digested with the restriction enzymes BamHl and Xbal, and the plasmid, now containing as an insert only part of the beta globin gene from the Xbal site to the HindIII site (FIG. 13a), was gel purified.

Syn pTAC was then ligated into the plasmid to create plasmid pDL IV-64a and JM109 cells were transformed. Individual transformants were isolated and analyzed by IPTG induction and SDS-PAGE to confirm the presence of a functional TAC promoter.

Plasmid pDL III-47a was digested with the restriction enzymes Pstl and HindIII and gel purified to remove the beta globin coding sequences. Plasmid pDL IV-64a was digested with these same enzymes and the fragment encoding syn pTAC/beta gel purified. Ligation of the syn pTAC/beta fragment into the Pstl/HindIII digested pDL III-47a created plasmid pDL IV-67a (FIG. 16). Individual transformants were screened for production of di-alpha and beta by IPTG induction and SDS-PAGE.

The dVal-beta globin gene under the control of syn PTAC was also ligated into another location in pDL III-47a. Syn pTAC/beta was removed from pDL IV-64a by digestion with Hind III and Pstl. The restriction site overhangs were filled with T4 polymerase and blunt end ligated into the Pvu II site of PDL III-47a to create plasmid pJR VI-54a (FIG. 17). IPTG induction and SDS-PAGE analysis were as previously described.

Results of induction experiments as evaluated by SDS-PAGE indicated that control of expression of di-alpha and beta by separate promoters gave little increase in the expression of either protein. Similarly, insertion of a second beta globin gene under regulation of a separate promoter had little effect upon production of the proteins.

EXAMPLE 13 Insertion of a Second Translationally Coupled Beta Globin Gene Into the Di-Alpha/Beta Expression Plasmid

The expression plasmid SGE1.1E4 (FIG. 15) contains a di-alpha globin gene followed by a single beta globin gene. Translation of the polycistronic mRNA results in the non-equimolar production of two alpha globin proteins and one beta globin protein. Hemoglobin is composed of equimolar quantities of alpha and beta globin, suggesting that this expression system may underproduce beta globin. In order to test this hypothesis, we inserted a second beta globin gene in front of the di-alpha globin gene and analyzed expression of recombinant hemoglobin from this plasmid. Expression of all genes from a single promoter ought to exert the greatest control over stoichiometry.

Construction of a di-alpha/beta expression plasmid which results in equimolar synthesis of alpha and beta globin proteins and synthesis of genetically stabilized hemoglobin which cannot dissociate into alpha/beta dimers is novel. To our knowledge no such hemoglobin molecule has ever been produced or shown to function as an oxygen carrier.

Plasmid pSGE1.1E4 (FIG. 15) was digested with the restriction enzymes HindIII and PstI. The beta globin gene was gel purified, T4 polymerase filled, and ligated into SmaI linearized pSGE1.1E4. Following ligation, the mixture was again digested with Smal to linearize plasmids not containing a second beta globin gene. The mixture was then transformed into E. coli strain JM109. Plasmids from individual isolates were screened for the presence of a second beta globin gene by digestion with the restriction enzyme ScaI. An individual isolate was designated SGE157. The plasmid was designated pSGE1.1.1E4 (FIG. 29).

A 2 liter fermentation was performed to obtain material for purification and functionality determinations. Hemoglobin in crude cell extracts, quantitated by the difference assay, was 2.8 mg/gm cell paste. This assay allows us to measure the concentration of functional hemoglobin. The addition of a second beta gene appears to have no significant effect on the production of functional hemoglobin.

EXAMPLE 14 Construction of Di-Beta Globin Expression Vector and Production of Di-Beta Hemoglobin

The general flowchart for this protocol is shown in FIG. 30.

a. Elimination of the NheI Site from Plasmid pGEM-1

Plasmid pGEM-1 was digested with restriction enzyme NheI, T4 polymerase filled, ligated, redigested with NheI, and transformed into E. coli strain SGE127.

Strain SGE127 was obtained by screening JM109 for phage resistance.

Plasmids were isolated from individual colonies and screened for loss of the NheI site by digestion with NheI. An E. coli isolate containing the appropriate plasmid was grown up. The plasmid was purified and designated pSGE223.

b. Cloning of Beta Asn¹⁰⁸ →Lys into PSGE223

The gene for beta globin containing the Presbyterian mutation, ASN¹⁰⁸ →Lys, was excised from plasmid pSGE1.1E4 (FIG. 15) with the restriction enzymes PstI and HindIII and gel purified. Plasmid pSGE223 was similarly digested and gel purified. The Beta fragment was ligated into pSGE223 and transformed into SGE127. Individual isolates were screened for the presence of the Beta globin fragment by PstI and HindIII digestion. An E. coli isolate containing the appropriate plasmid was grown up. The plasmid was purified and designated pSGE224.

c. Insertion of the Di-Beta Linker Sequences

Plasmid pSGE224 was digested with the restriction enzymes BamHI and SacII and gel purified. Annealed complementary synthetic oligonucleotides encoding the 2, 3, 4, or 5 glycine (GGT codon) linker sequences were ligated into the plasmid and transformed into SGE127.

Isolates were screened for the presence two NheI sites. E. coli isolates containing the appropriate plasmids were grown up. The plasmids were purified and designated pSGE 229 (5 glycine linker), pSGE232 (4 glycine linker), pSGE233 (3 glycine linker), or pSGE231 (2 glycine linker).

d. Construction of the Di-Beta Globin Gene Containing the 5 Glycine Linker

Plasmid pSGE 229 was digested with NheI. The fragment containing the 5 glycine linker and the beta globin gene was gel purified and ligated into NheI digested plasmid pSGE224. Individual isolates were screened by digestion with the restriction enzymes HindIII and PstI. An appropriate isolate was grown up. The plasmid was purified and designated PSGE 234.

e. Construction of pSGE 0.0E5

Plasmid pSGE0.0E4 was digested with BamHI to remove the 5' coding sequences for the tet resistance gene. After gel purification, the plasmid fragment was re-ligated and transformed into E. coli strain JM109. Isolates were selected for resistance to ampicillin and sensitivity to tetracycline. An individual isolate was grown up, the plasmid purified and digested with BamHI to confirm loss of the BamHI fragment. The isolate was designated SGE0.0E5. The plasmid was designated pSGE0.0E5.

f. Construction of pSGE1.05E5

Plasmids pSGE234 and pSGE0.0E5 were digested with HindIII and PstI. The di-beta-containing fragment was gel purified from pSGE234 and ligated into gel purified pSGE0.0E5 from which the beta fragment had been removed. Following transformation, individual isolates were screened by SDS-PAGE for their ability to produce alpha and di-beta globin proteins subsequent to induction with IPTG. An appropriate isolate was grown up and designated strain SDGE242. The plasmid was designated pSGE1.05E5.

g. Construction of PSGE1.05E4

The BamHI fragment containing the 5' end of the tetracycline resistance gene was gel purified from plasmid pSGE1.1E4 and ligated into BamHI-linearized plasmid pSGE1.05E5. Isolates were selected by plating the transformation mixtures on tetracycline containing nutrient agar plates. Individual isolates were screened for their ability to produce alpha and di-beta proteins subsequent to IPTG induction. An individual isolate was grown up and designated SGE245. The plasmid was designated pSGE1.05E4 (FIG. 31).

Production of di-beta hemoglobin (5 glycine linker) was achieved with this expression vector in E. coli but:

a. expression levels were low compared to our di-alpha/beta expression system (4 to 5 fold lower yield).

b. the oxygen affinity of the alpha/di-beta hemoglobin was high P₅₀ =5.9 torr (±0.6 torr) vs. ⁻⁻ 32 torr for the di-alpha/beta molecule. (at 37° C.) However, cooperativity, as measured by the Hill constant (n) was similar to the di-alpha construct (n=2.0±0.2 for both di-beta and di-alpha).

Although a molecule with such a low P₅₀ may not be useable as blood substitute or an oxygen-carrying plasma expander, it may have other uses. For example, it may be particularly useful for delivering oxygen to regions of tumors where oxygen tension is particularly low. Enhanced oxygen delivery during radiation or chemotherapy is known to enhance the therapeutic effects of anti-neoplastic agents:

REFERENCES

(1) Dowling, S., J. J. fischer and S. Rockwell. (1991) A clinical study of fluosol and hyperbaric oxygen as an adjunct to radiation therapy. Biomat. Art. Cells and Immobil. Biotech 19, 377.

(2) Herman, T. S. and B. A. Teicher (1991) Enhancement of radiation therapy by an experimental concentrated perfluorooctylbromide (Oxygent) emulsion in the Lewis lung carcinoma. Biomat. Art. Cells and Immobil.

Biotech. 19, 395.

(3) Holden, S. A., B. A. Teicher and T. S. Herman (1991) Effect of a PFOB Emulsion (Oxygent) and carbogen breathing on the tumor cell survival of the FSaIIC fibrosarcoma after treatment with antitumor alkylating agents. Biomat. Art. Cells and Immobil. Biotech. 19, 399.

EXAMPLE 15 Hypothetical Protocol for Development of Linkers by Mutation and Selection

In this hypothetical example, the linker suitable for genetic fusion of globin subunits is obtained by mutagenesis of a linker-encoding DNA sequence and selection for functional linkers. Oligonucleotides spanning the BstBI site of alpha₁ to the EaqI site alpha₂ will be synthesized such that the six nucleotides comprising the preferred glycine-glycine linker are randomized. By randomizing these nucleotides, codons for all combinations of amino acids will be present in the oligonucleotide mixture. Following purification and annealing of the oligonucleotides they will be used to construct the di-alpha/beta co-expression genes as described above.

Clones containing the various di-alpha/beta plasmids will then be screened for production of increased levels of recombinant hemoglobin using a protocol developed at the Company. E. coli clones will be arrayed on nitrocellulose filters overlayed on 2xYT-ampicillin plates containing 1 mM IPTG. Following overnight incubation of 37° C., the plates will be sealed in a plastic bag in which the air has been displaced by carbon monoxide (CO). CO binding to intracellular recombinant hemoglobin produces a distinctive red color in the E. coli colonies. Colonies producing the most intense red color will be further analyzed.

In this experiment the assumption is made that certain combinations of amino acids in the di-alpha linker will permit more stable folding of the individual, linked alpha globin chains and, therefore, result in greater levels of production of intracellular recombinant hemoglobin. This increase level of production will result in a more intense red color in the appropriate E. coli clones.

After selection of several clones producing higher levels of recombinant hemoglobin, more detailed analyses will be done on individual clones to determine the optimal di-amino acid linker. The analyses will include determination of quantities of recombinant hemoglobin produced, oxygen affinity, and protein stability. Finally, clones found to be producing the best quality recombinant hemoglobin will be DNA sequenced to determine the amino acids comprising the linker.

This technique is equally applicable to other globin pseudodimers (di-beta and alpha/beta).

EXAMPLE 16 Hypothetical Protocol for the Synthesis of Plasmids Containing Alpha and Beta Globin Genes under the Regulation of Two Separate Promoters on the Same Plasmid

It is anticipated that recombinant hemoglobin can be expressed from constructs where the different globin genes are under the control of separate promoters. This situation would yield two separate mRNA's; one with a dicistronic sequence encoding an alpha globin gene and another with a dicistronic sequence encoding a beta globin gene. For construction of an expression system in which both the alpha and beta globin genes are under the regulation of separate promoters, on the same plasmid, the following protocol would initially be used. Plasmid pDL III-1a containing the des-val alpha globin gene would be digested with the restriction enzyme BamHI, reacted with bacterial alkaline phosphatase, phenol extracted, ethanol precipitated, and resuspended in TE buffer. The plasmid pJR IV 50-a which is the pKK expression plasmid containing the des-val beta construct would then be digested with the restriction enzymes BamHI and PvuI, to excise a fragment from the plasmid containing the P_(tac) promoter, the des-val beta sequence, the transcriptional terminator sequence and a portion of the ampicillin resistance gene. Following gel purification of this fragment, a PvuI-BamHI linker would be synthesized and ligated onto the insert. The insert would then be back-cut with BamHI to generate BaMHI compatible sites on both the 5'- and 3'- ends of the insert. This insert would then be cloned into BamHI linearized plasmid pDL III-1a, resulting in a plasmid in which a translationally coupled des-val beta globin gene under regulation of one P_(tac) promoter is positioned on the 3' side of a translationally coupled des-alpha globin gene under regulation of a separate P_(tac) promoter. Restriction enzyme mapping would be used to confirm the orientation of the beta globin containing insert. E. coli JM-109 would be transformed with the plasmid containing the separate P_(tac) -globin constructs and grown in mediα containing ampicillin to isolate clones containing the plasmid. The clones containing the plasmid would then be induced with IPTG and expression of des-val alpha globin, des-val beta globin and des-val hemoglobin would be assayed by SDS-PAGE analysis, Western blotting with anti-hemoglobin antibodies and isolation of des-val hemoglobin by standard chromatographic methods.

Alternatively, coexpression of both globin genes could be achieved from DNA sequences on separate vectors under the control of separate promotors.

EXAMPLE 17 Hypothetical Protocol for the Construction of Vectors Containing Alpha and Beta Globin Genes Under the Regulation of Separate Promoters and on Different Vectors

E. coli clones containing plasmid pDL III-1a which is the pKK223-3 plasmid containing a dicistronic loader gene/des-val alpha construct under the regulation of the P_(tac) promoter would be transformed with a plasmid containing the dicistronic loader gene/des-val beta construct under control of the same promoter, but with a gene conferring additional antibiotic resistance to tetracycline. This could be constructed in the following manner: Plasmid pJR IV-50a contains the des-val beta globin gene under control of the P_(tac) promoter. This plasmid would be cut with PvuII to generate a linear plasmid with blunt ends. This would be ligated with a NotI phosphorylated linker (New England BioLabs). The ligation mixture will be used to transform E. coli. Plasmid DNA would be prepared and plasmids containing a NotI site identified by digestion with NotI agarose gel electrophoresis. This plasmid will contain the P_(tac) :des-val beta globin sequence. The gene for resistance to .he antibiotic kanamycin is commercially available (Pharmacia) and contains EcoRI restriction sites on both ends. The ends will be converted to blunt ends by treatment with T4 DNA polymerase by the method of Maniatis, et al. The resulting fragment will be ligated with a 50 fold excess of phosphorylated NotI linker (25° C., 60 min). The ligation reaction would be made 0.01M in EDTA, heated to 70° for 20 min and ethanol precipitated. The precipitated DNA will be taken up in 100 ul of NotI buffer and treated with 100 units of NotI (37°) for 2 hr. The fragment would be purified by agarose gel electrophoresis. The NotI adapted kanamycin resistance gene would then be ligated into the NotI linearized pJR IV-50a to yield a plasmid with the gene for des-val beta globin under control of P_(tac) with kanamycin resistance. E. coli JM-109 clones containing plasmid pDL III-1a, the plasmid containing des-val alpha globin under control or P_(tac) with resistance to ampicillin, will then be transformed with kanamycin resistant plasmid containing the gene for des-val beta globin and clones will be selected for resistance to both ampicillin and kanamycin. Other antibiotic resistance genes could be used as well. Expression of alpha and beta globin polypeptides under the regulation of separate promoters will then be analyzed by IPTG induction, SDS page and western blotting.

Should we encounter problems with plasmid exclusion, we could use the same strategy with the pIN plasmids that have been used to express polypeptides from separate plasmids in E. coli (McNally, et al., PNAS 85, 7270, 1988).

One potential problem that we may face with creating plasmids in which the alpha and beta globin genes are under separate but identical promoters is the possibility of homologous recombination within the identical sequences on the plasmids, eg. the promoter region. This could result in deletion of a segment of important DNA sequence. It is therefore preferable to use different, non-homologous promoters for each different globin gene, eg. P_(tac) and _(Ptrc), or the lambda P_(L) promoter in appropriate host (containing cI857).

EXAMPLE 18 Synthesis and Assembly of the Di-Alpha Beta Globin Construct in aP_(L) Regulated Vector System

In prior examples, the globin genes were under Tac promoter control, and the alpha (or di-alpha) and beta globin genes were each translationally coupled to the ribosomal loader cistron taught by Schoner, et al. In this example, the lambda P_(L) promoter and a different translational coupler (see below) are used.

Translational Coupler ##STR7##

The pL expression system has a different translational coupler as compared to the pTAC system. Sequences coding an SD and a translational stop were added downstream from the 5' end of the N protein coding sequences to act as a translational coupler. Subsequent to that the globin coding sequences are identical to that used in the pTAC system.

Using the pPL-lambda vector available from Pharmacia, a plasmid construct was assembled to generate a genetically crossed-linked tetrameric human hemoglobin (wild type) in E. coli strains N99Ci+ and N4830-1 (cI857). These bacterial strains were obtained from Pharmacia and are inducable by addition of naladixic acid (40 μg/ml) or mitomycin C (10 μg/ml) in the presence of the wild type Ci+ repressor or heat treatment of the strain containing the cI857 repressor gene (Mott, et al., PNAS 82, 88, 1985 and Gottesmann, M. E., et al., J.Mol. Biol. 140, 57, 1980). A diagramatic representation of the cloning strategy is depicted in FIG. 19.

Removal of the EagI Site from the pPL-Lambda Vector

Removal of the EagI site from pPL-lambda was necessary to enable cloning of the di-alpha gene sequence, because both alpha structural genes contain a EagI site located 6 bp into the coding sequence. The pPL-lambda vector was digested with EagI, and the ends were filled using T4 DNA polymerase. The Bam HI linker (5'-CCCGGATCCGGG-3')(Pharmacia), was blunt-end ligated to the EagI digested pPL-lambda plasmid by standard methods. This eliminated the EagI site in the desired construct. The resulting mixture was digested with EagI to eliminate any plasmids still containing the EagI site. E. coli N99Ci+cells were transformed with resulting plasmid, pPL-lambda-E. Clones containing the desired plasmid were identified by restriction digest analysis.

Incorporation of the Synthetic Co-Translational Coupler into pPL-Lambda-E

Prior to inserting the globin genes into the vector it was necessary to incorporate the synthetic translational coupler sequence into the HpaI site of pPL-lambda-E. This was done by digestion of pPL-lambda-E with HpaI followed by blunt-end ligation of the co-translational coupler into the HpaI site of the vector. Ligation of the coupler to the blunt end resulted in destruction of the HpaI site. The ligation mixture was treated with HpaI to digest any plasmid remaining containing the HpaI site. E. coli N99Ci+ cells were transformed with the resulting reaction mixture. Clones were screened with EcoRI and Hind III restriction digests to identify clones containing the co-translational coupler in the proper orientation. DNA fragments of 522 bp and 4762 bp were observed for plasmid containing the desired orientation. To confirm the orientation of the coupler, the resulting plasmid was sequenced using a primer (5'CAATGGAAAGCAGCAAATCC-3') complementary to the sequence 30 base pairs upstream from the translational coupler sequence. The desired plasmid was denoted as pPL-lambda-E+TC.

Construction of an Expression Plasmid Containing Des-Val Alpha and Beta Genes Under Control of pPL-Lambda

The Des-Val alpha and beta globin genes were obtained from pDL III-14c by digestion with EagI and Hind III, followed by agarose gel purification of the desired 942 bp segment. The purified alpha and beta globin gene fragment was cloned into EagI and HindIII digested pPL-lambda-E+TC. The ligation mixture was used to transform E. coli N99Ci+, and clones were screened for the presence of the desired plasmid, pPL-alpha/beta. Digestion with EcoRI (4758 and 1468 bp), and with PstI and HindIII (4005, 1775, 52u bp) confirmed presence of the desired restriction sites. Further confirmation was obtained through sequencing with the 20 bp primer, above, to confirm the sequence between the co-translational coupler and the Des-Val alpha globin gene, and a second primer (5'ACCCGGAAAACTTCCGTC-3') to confirm the sequence between Des-Val beta and the pPL vector.

Construction of an Expression Plasmid Containing Di-Alpha and Beta Globin Genes under Control of pPL-Lambda

An RGV crosslinker which encodes for the carboxy terminal portion of alpha globin, linked via a single glycine residue to the native sequence of the amino portion of a second alpha globin chain, was prepared by separately phosphorylating the 5' ends of 5'CGAAATAACGTGGTGTTCTGTCTGC-3' and 3'TTTATGGCACCACAAGACAGACGCCGG-5' with T4 kinase, followed by annealing. This double stranded oligonucleotide was cloned onto the EagI end of a purified fragment of Des-Val alpha and beta globin prepared from pDL III-14c digested with EagI and HindIII, as described above. The linear DNA sequence generated from this ligation, now containing sticky ends coding for BstBI and HindIII restriction site sequences, was purified by agarose gel electrophoresis and cloned into BstBI and HindIII digested pPL-alpha/beta. The new plasmid designated pPL-dialpha/beta contained a sequence with a co-translational coupler upstream to a sequence containing di-alpha globin linked via a glycine residue, followed by a cotranslational coupler adjacent to a beta globin gene sequence, all under control of a single PL promoter. These clones were identified through screening minipreps with EagI restriction digestions. Clones without the second alpha globin gene merely linearised upon digestion, whereas clones containing the second gene released 431 bp and 6222 bp DNA fragments.

Construction of the Expression Plasmid pSGE0.1-L0 Containing a ROP- Origin of Replication Mutation pPL-dialpha/beta was digested with PvuII then treated with T4 DNA polymerase to fill in the sticky ends. The linearized plasmid was then blunt end ligated with a NotI linker (Promega Corp., Madison, Wis.) (5'TTGCGGCCGCAA-3'). The ligation mixture was then treated with PvuII to remove any remaining plasmid containing the PvuII site. E. coli were transformed with pSGE0.1-L0 and positive clones were identified by the presence of the unique NotI restriction site.

Expression of Hemoglobin SGE0.1 in E. Coli Under the Control of the Lambda P_(L) Promoter

E. coli N99Ci+ and E. coli N4830-1 were transformed with pSGE0.1-L0 and grown on agar plates containing ampicillin, as described previously. These E. coli strains contain the cI+ repressor gene and the cI857 heat sensitive repressor gene, respectively.

Inocula of N99Ci+ were grown at 37° C. in TB mediα to an OD₆₀₀ of ⁻⁻ 1.0 and induced with naladixic acid (40 μg/ml). Cultures were incubated for 4-6 hrs. at 37° C. before the cells were harvested. Hemoglobin production was estimated by SDS-PAGE analysis of total cell protein and by western blot analysis. By these techniques, SGE0.1 was estimated to be produced at ⁻⁻ 0.02% of the total cell protein in this cell line. Hemoglobin (57 μg) was isolated by Mono Q chromatography and shown to have an optical spectrum representative of that for normal hemoglobin.

Inocula of N4830-1 were incubated at 30° C. in TB media to an OD₆₀₀ of 1.0 and induced by addition of sufficient preheated TB mediα (65° C.) to raise the temperature of the inocula TO 42° C. The culture was then incubated at 42° for 4-6 hrs. Total cell protein analysis with SDS-PAGE revealed that SGE0.1 was being synthesized at 0.4% of the total cell protein, corresponding to 0.18 mg protein per gram of wet cell paste. SGE0.1 prepared from a 2 L preparation was purified as described elsewhere, and resulted in isolation of 7.6 mg of purified material. SGE0.1 (7.6mg) was isolated and had a P₅₀ of 4.1 (Hemox Analyzer, pH 7.4, 0.1M NaCl, 37° C.), and an optical spectrum representative of native hemoglobin.

EXAMPLE 19 Production of Hemoglobin in Yeast

All restriction enzymes and DNA-modifying enzymes were purchased from BRL, New England Biolabs, IBI, Pharmacia or Boerhringer-Mannheim. The concentrations of enzymes used were those suggested by the supplier to produce a complete reaction in 30 minutes. The buffers and conditions for the use or these enzymes were those provided with the enzymes, unless otherwise stated. Plasmid DNA was purified from E. coli DH5α as described by Birnboim and Doly (Nucleic Acids Research 1979, 7:1513-1520). Electrophoretic analysis of DNA was carried out in agarose gels using tris-acetate electrophoresis buffer (Maniatis et al. Molecular Cloning, Cold Spring Harbor, N.Y., 1982). DNA was visualized by staining the gels with 0.5 μg/ml ethidium bromide and exposing the gel to ultraviolet light. DNA fragments were purified from agarose gels using a kit purchased from BIO-101. DNA fragments were purified from acrylamide gels by crushing the excised gel fragment, containing the DNA of interest, in 3.25M ammonium acetate and incubating overnight at 37° C. Gel fragments are removed by centrifugation (12,000× g, 15 min) and the DNA precipitated with 2 volumes of 95% ethanol, 5% isopropanol. The precipitate is dried in vacuo and dissolved in 0.1XTE (LXTE is 10 mM Tris. HCl pH 7.8, 1 mM Na₃ EDTA). Acrylamide gel electrophoresis of DNA was done as described by Maniatis, et al. (Molecular Cloning, Cold Spring Harbor, N.Y., 1982.) in tris-acetate electrophoresis buffer. Bacteriological growth mediα and DNA transformation methods are described by R. W. Davis et al. (Advanced Bacterial Genetics, Cold Spring Harbor Laboratory, New York, 1980, p140-141). Transformation of S. cerevisiae with linear or circular DNA was carried out as described by H. Ito et al. (J. Bacteriology 153:163-168 (1983)). Transformants were selected on SD medium lacking uracil or tryptophane (SD-ura, SD-trp) depending on the selectable marker on the plasmid (F. Sherman et al., Methods in Yeast Genetics: A Laboratory Manual, Cold Spring Harbor Laboratory, 1979). All other yeast mediα used are described by Sherman et al. (ibid.)

SYNTHESIS AND ASSEMBLY OF A GALACTOSE REGULATED PROMOTER

This synthetic promoter consists of two functional parts, a regulatory sequence and sequence that allows efficient initiation of mRNA synthesis. One of the regulatory regions we chose includes the nucleotide sequence that confers positive regulation of transcription in the presence of galactose (M. Johnston and R. Davis, 1984. Molecular and Cellular Biology 4:1440-1448; L. Guarente et al., 1982, Proc Nat Acad Sci (USA) 79:7410-7414.). The transcriptional initiation site is derived from the consensus sequence for the S. cerevisiae glyceraldehyde-3-phosphate dehydrogenase gene (GAP491) (L. McAlister and M. J. Holland, J. Biol Chem 260:15019-15027, 1983; J. P. Holland et al., J. Biol Chem 258:5291-5299, 1983).

The synthetic oligonucleotides shown in FIG. 1 were synthesized on a Biosearch 8600 DNA synthesizer. Each oligonucleotide was cleaved from its support column with 28% NH₄ OH. The blocking groups were removed by incubating the cleaved oligonucleotide in 28% NH₄ OH at 65° C. for ≧16hr. All oligonucleotides were purified by preparative polyacrylamide gel electrophoresis in slabs of 10% acrylamide (19:1 acrylamide:bis-acrylamide) containing 7M urea. oligonucleotides were eluted from acrylamide slices by incubation (16hr) in 50 mM ammonium acetate (pH 7.4), 2.5 m magnesium acetate, 0.25 mM EDTA and 0.25% SDS at 37° C. Acrylamide fragments were removed by centrifugation (14,000× g, 10 min) and the oligonucleotide precipitated from the aqueous phase by the addition of NaCl to 0.25M and 3 volumes of 100% ethanol. The precipitated oligonucleotides were collected by centrifugation (14,000× g for 30 min), washed twice with 80% ethanol, once with 100% ethanol and the pellets dried. Pellets were dissolved in 0.1 XTE. 2×10⁻¹⁰ moles (each) of oligonucleotides 1-5 (Table 5) were phosphorylated in 0.02 ml of 0.066M Tris HCl (pH 7.6), 0.01M MgCl₂ 0.002M dithiothreitol (DTT), 0.001M spermidine and 10 units of T4 polynucleotide kinase. Phosphorylation reactions were carried out at 37° C. for 30 min and terminated by heating to 96° C. for 5 min. 2×10⁻¹⁰ moles of oligonucleotides 1 and 6 (Table 5) were added to the phosphorylated oligonucleotides in a final volume of 0.04 ml of 0.07M Tris HCl (pH 7.6) 0.01M MgCl₂. The mixture of oligonucleotides 1-6 (Table 5) was heated to 96° C. for 5 min, 75° C. for 20 min, 55° C. for 30 min, 37° C. for 60 min and 25° C. for 15 min. T4 DNA ligase (10 units), DTT (0.002M final concentration), and ATP (0.001M) were added and the mixture incubated at 4° C. for 16hr. The resulting 210 bp oligonucleotide contains a 5' end compatible with a SalI restriction endonuclease site and a 3' end compatible with an XbaI site. Because the oligonucleotides comprising the two ends of the intact oligonucleotide were not phosphorylated, they cannot ligate to each other. This oligonucleotide was cloned into the vector pSK+(Stratagene, Inc.) (FIG. 21(a)) that had been digested with XbaI and SalI. The ligation mixture contained 50 ng of XbaI, SalI digested pSK(+), and 5 pMoles of the ligated oligonucleotide in a volume of 0.001 ml. E. coli DH5α was transformed with a portion of the ligation reaction and clones that contain inserts were identified by screening for white colonies on LB-ampicillin (0.15 mg/ml) agar plates supplemented with XGAL (4 μg/ml). Positive identification was made by preparing plasmid DNA from these isolates and digesting with XbaI and SalI. The restriction digests were analyzed by agarose gel electrophoresis and three clones containing a fragment of the expected size (⁻⁻ 250 bp) were identified. The DNA sequence of all three clones was determined and one was chosen for further use and designated pGS2488 (FIG. 21(a)).

ASSEMBLY OF THE SYNTHETIC GALACTOSE UPSTREAM ACTIVATOR (GAL_(UAS) SEQUENCE

The oligonucleotides shown in Table 6 were synthesized and purified as described above. Oligonucleotides 2-5 were phosphorylated, annealed with oligonucleotides 1 and 6 and ligated as described for the assembly of GAP. The full length oligonucleotide generated by this protocol has non-phosphorylated ends compatible with the restriction endonuclease sites generated by SphI and SalI. However, when the oligonucleotide is ligated to a SalI site the resulting junction formed between the two fragments will no longer contain a cleavable SalI site.

The GA AS is contained on an SPHI-SalI fragment. To clone this fragment into pGS2488 required that we change the KpnI site of this plasmid to an SphI site. The plasmid pGS2488 was modified by cleaving with KpnI. The KpnI digested plasmid was incubated with 2 units of T4 polymerase in 0.05 ml buffer containing 50 μM of each deoxyribonucleotide triphosphate (A,G,C,T), 0.033M Tris-acetate (pH 7.9), 0.066M potassium acetate, 0.01M magnesium acetate, 0.5 mM DTT and 100 μg/ml bovine serum albumin (BSA). Na₃ EDTA was added to 0.015M and the mixture extracted 1× with phenol-chloroform. DNA was precipitated with ethanol. The dry pellet was dissolved in 0.008 ml of T4 DNA ligase buffer, 50 ng phosphorylated SphI linkers (New England Biolabs) and 10 units of T4 DNA ligase. The mixture was incubated for 1 hour at 25° C. and used to transform E. coli DH5α. Plasmid DNA was prepared from 12 transformants and tested by restriction enzyme digestion and agarose gel electrophoresis, for the presence of an SphI site and the absence of a KpnI site. A clone containing a plasmid with these characteristics was identified and the plasmid was designated pGS2888 (FIG. 21(b)).

The next step in the assembly of this hybrid promoter was to clone the SphI - SalI fragment containing the GAL_(UAS) into pGS2888. pGS2888 was digested with SphI and SalI, phenol-chloroform extracted and ethanol precipitated. Fifty nanograms of SphI, SalI digested pGS2888 was incubated with 25 ng of the annealed, ligated GAL_(UAS) mixture in 0.005 ml 1× ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated overnight at 4° C. and a portion used to transform E. coli DH5α. Ampicillin resistant clones were isolated and plasmid DNA prepared. The plasmid DNA (digested with XbaI and SpI) was analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the expected size (¹³ 500bp) was identified. The sequence of the putative GAL_(UAS) portion of this plasmid was determined and the plasmid was designated pGS4788 (FIG. 21(b)). The complete sequence of the synthetic GALGAP promotor (PGGAP) is shown in FIG. 20.

CONSTRUCTION OF A pGGAP-β-GLOBIN EXPRESSION CASSETTE

The plasmid pLCIIFX-β-globin (K. Nagai, M. Perutz and C. Payart, Proc Nat Acad Sci (USA) 82:7252-7255) was used as the source of human β-globin cDNA. The coding region of the β-globin cDNA can be excised as an ApaL1 to Hind3 fragment that is missing only the first four nucleotides of the β-globin coding (translated) sequence. pLCIIFX β-globin (5 μg) was digested with ApaL1 and Hind3 and a 550 bp fragment containing the cDNA was purified by acrylamide gel electrophoresis. The ApaL1-Hind3 fragment containing the β-globin cDNA was cloned into Xba1, Hind3 digested pUC19 using the following adaptor (synthesized and purified as described above): ##STR8##

The two oligonucleotides were mixed (12.6 μg of each), NaCl was added to 0.25M and three volumes of ethanol (anhydrous) was added. The precipitated oligonucleotides were collected by centrifugation (14,000× g, 15 min) and the pellet washed twice with 80% ethanol, once with anhydrous ethanol and dried in vacuo. The adaptors were dissolved in 1 ml of 0.1XTE. Ligation reactions were conducted using fifty nanograms of Xbal-HindIII fragments containing the β-globin cDNA and 12.5 ng of the ethanol precipitated (non-phosphorylated) VH1a,b oligonucleotide in a total of 10 μl of T4 DNA ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated at 4° C. for 16 hr. and a portion used to transform E. coli DH5α. Transformants were selected on LB-ampicillin plates containing 4 μg/ml XGAL. Plasmid DNA was prepared from 12 white colonies. The DNA was digested with Nco1 or ApaL1 and analyzed by agarose gel electrophoresis. Four of these colonies contained plasmids with the expected restriction fragments and one was designated pUC19β-globin (FIG. 21(a)). The plasmid pSUC2-6Σ (G. Stetler et al. Biotechnology 7:55-60 (1989)) (FIG. 21(a)) was digested with Hind3 and Xba1 and the large fragment was purified by agarose gel electrophoresis. The Xbal to Hind3 fragment containing β-globin cDNA was also purified (agarose gel electrophoresis) from pUC19β-globin. Ten nanograms of gel-purified Xba1, Hind3 digested pSUC2-6Σ was mixed with 16 nanograms of the Xbal, Hind3 fragment from pUC19β-globin in 0.01 ml of ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated at 25° C. for 1 hr. and a portion used to transform E. coli DH5α. Transformants were selected on LB-ampicillin medium and three were used to prepare plasmid DNA. These were analyzed by digestion with EcoR1 and analyzed by agarose gel electrophoresis. Two of these contained plasmids with the expected restriction fragments and one was designated pGS1188 (FIG. 21(b)). This plasmid contains β-globin under the transcriptional control of the sigma promoter and contains the 3' transcriptional termination signals and polyadenylation signals of the MFα1 gene.

REPLACEMENT OF THE SIGMA PROMOTER WITH pGGAP

The plasmid pGS1188 was digested with SphI and NcoI and the vector plus β-globin cDNA was separated from the sigma promoter by agarose gel electrophoresis and purified as described previously. Plasmid pGS4788 (10 ug) was also digested with SphI and NcoI and the ⁻⁻ 500 bp fragment (containing pGGAP) produced by this digest was purified by agarose gel electrophoresis. Fifty nanograms of the gel-purified β-globin containing vector was incubated with 50 ng of NcoI-SphI fragment containing the pGGAP promotor in 0.01 ml of 1× ligase buffer with 10 units of T4 DNA ligase. The mixture was incubated for 1 hour at 25° C. and a portion of the ligation mixture used to transform E. coli DH5α. Ampicillin resistant clones were selected. Plasmid DNA isolated from 12 of these clones was analyzed by digestion with SphI and NcoI to identify plasmids containing the ⁻⁻ 500 bp GGAP promoter. The presence of the β-globin cDNA was confirmed by digestion with XbaI and SalI, followed by agarose gel electrophoresis analysis. A plasmid containing the expected fragments was identified and designated pGS3588 (FIG. 21(b)). To aid the subcloning of this fragment into a yeast vector, the SmaI site of pGS3588 was converted to a XhoI site as follows: 1 μg of pGS3588 was digested with SmaI, the digest was extracted once with phenol-chloroform and ethanol precipitated. The precipitated DNA was dissolved in T4 DNA ligase buffer, with 100 ng of phosphorylated XhoI linker and 10 units of T4 DNA ligase (final volume of 0.01 ml). The ligation was incubated at room temperature for 2 hr and a portion of the ligation mixture was used to transform E. coli DH5α (excess linkers were not removed prior to transformation). Ampicillin resistant clones were isolated and plasmid DNA prepared. Plasmids containing the additional XhoI site were identified by digestion with XhoI and agarose gel electrophoresis analysis. A plasmid containing the pGGAP-β-globin expression cassette was identified and has been designated pGS3888 (FIG. 21(b)).

CONSTRUCTION OF AN α-GLOBIN EXPRESSION CASSETTE

We obtained a partial length cDNA (pα-MRC) clone from K. Nagai (MRC, Cambridge). To adapt the cDNA encoding α-globin for expression from the pGGAP promotor, two oligonucleotide primers were synthesized (synthesis and purification of oligonucleotides was as described above).

α-1: 5'-GAATTCCATGGTGCTGTCTCCTGCCGACAAGACC-3'.

α-2: 5'-CTGCAGTCGACTTAACGGTATTTGGAGGTCAGCACGGTGCT-3'.

These two oligonucleotides were used as primers for a polymerase chain reaction (PCR) (R. K. Sakai et al., 1985. Science 230:1350-1354) using a Perkin Elmer-Cetus PCR kit and pα-MRC as template. α-globin cDNA (8.3 ug/ml) in 0.018 ml H₂ O was denatured by the addition of 0.005 ml 10M NaOH. The mixture was incubated at 25° C. for 5 min. Denatured DNA was precipitated by the addition of 0.003 ml 3M sodium acetate (pH 5.2) and 0.075 ml anhydrous ethanol. The precipitated DNA was washed twice with 80% ethanol, once with 100% ethanol and dried. The dried pellet was dissolved in: 0.005 ml 20 μm α-1, 0.005 ml 20 μM α-2, 0.010 ml 10× Taq polymerase buffer (Perkin Elmer-Cetus), 0.0005 ml TaqI polymerase (Perkin Elmer-Cetus), 0.663 ml H₂ O, 0.016 ml of all four deoxyribonucleotide-triphosphates (1.25 mM each). TaqI polymerase was added after heating the solution to 94° C. for 1 min. After addition of the enzyme the aqueous solution was overlayed with 0.100 ml paraffin oil. The reaction-mixture was cycled, by hand, 25 times at the following temperatures: 37° C. for 2 min, 68.5° C. for 3 min and 94° C. for 1 min. After the twenty-fifth cycle, the reaction mix was incubated at 37° C. for 2 min and 68.5° C. for 10 min. A portion (0.005 ml) of the reaction mixture was analyzed by agarose gel electrophoresis and revealed a band of the expected size (⁻⁻ 430 bp). The PCR-amplified DNA fragment should contain a 5' extension, that includes an ATG codon embedded in an optimal sequence context for initiation of translation (M. Kozack 1986, Cell 44:283-292). The 3' end should contain an extension that includes a translational terminator and a SalI restriction endonuclease site. The PCR-amplification reaction was phenol-chloroform extracted and ethanol precipitated. The dry pellet was dissolved in 0.05 ml 1 mM Tris-HCl (pH 7.8). A portion of this material (0.025 (0.25 ml, 1.25 ug) was digested with NcoI and SalI and purified by acrylamide gel (5%) electrophoresis. A gel slice containing the fragment was eluted by crushing the gel slice in 0.3 ml 2.5 ammonium acetate (pH 7.4) and incubating at 37° C. for 16 hr. Acrylamide fragments were removed by centrifugation and the DNA precipitated by the addition of 0.75 ml of ethanol. The pellet was collected and dissolved in 0.020 ml 1 mM Tris HCl (pH 7.8) 0.1 mM EDTA. This fragment was cloned into NcoI, SI digested and agarose gel purified pGS3888. Fifty nanograms of NcoI, SalI digested pGS3888 was incubated (2 hr, 25° C.) with 50 ng of gel purified, PCR-amplified NcoI-SalI fragment containing the α-globin cDNA in 0.01 ml ligase buffer with 10 units of T4 DNA ligase. A portion of this reaction mixture was used to transform E. coli DH5αα- and ampicillin resistant clones were selected on LB-ampicillin medium. Plasmid DNA was prepared from 12 independent isolates and digested with Nco1 and SalI. The restriction digests were analyzed by acrylamide gel electrophoresis (5%), all twelve contained a fragment of the expected size. One of these was designated pGS4088 (FIG. 22). The α-globin insert in pGS4088 was completely sequenced to assure that no mutations had been introduced by PCR-amplification.

CONSTRUCTION OF A YEAST EXPRESSION PLASMID THAT CO-EXPRESSES α- AND β-GLOBIN GENES FROM A SINGLE PLASMID

The α-globin and β-globin expression cassettes from the plasmids pGS3888 and pGS4088 were cloned into a single plasmid in a way that allows them to be excised on a single NotI fragment. This NotI fragment was then cloned into the high copy yeast plasmid pC1N to generate a plasmid carrying and expressing both α- and β-globin chains under the control of separate (though identical) promoters. The details are presented below.

INTRODUCTION OF NOTI SITE INTO pSK(+)

The plasmid pSK(+) (Stratagene, Inc.) (FIG. 23(a)) was modified by digesting 100 ng of purified plasmid DNA with KpnI. After digestion was complete, the DNA was ethanol precipitated and the dry pellet dissolved in 0.05 ml T4 DNA polymerase buffer containing 10 units of T4 DNA polymerase. The reaction mixture was incubated at 37° C. for 2 min, Na₃ EDTA (10 mM) was added and the sample heated to 70° C. for 10 min. The digested DNA was precipitated with ethanol and the dry pellet dissolved in 0.01 ml of 10 mM Tris-HCl (pH 7.8) 1 mM EDTA. A portion of this material (20 μg) was dissolved in 0.005 ml of ligase buffer containing 10 units of T4 DNA ligase and 50 ng of phosphorylated NotI linkers. The ligation mixture was incubated at 25° C. for 2 hr and a portion used to transform E. coli DH5α. Ampicillin resistant colonies were selected on LB-ampicillin medium. Plasmid DNA was isolated, digested with Notl and analyzed by acrylamide gel (5%) electrophoresis. A plasmid containing the additional NotI site is expected to generate a new, ⁻⁻ 90 bp fragment. Such a plasmid was identified and designated pSN(+) (FIG. 23(a)).

CLONING THE pGGAP-α-GLOBIN EXPRESSION CASSETTE INTO pSN(+)

pSN(+) was digested with SalI, phenol-chloroform extracted and ethanol precipitated. The precipitated DNA was dissolved (50 ug/ml) in 1 mM Tris-HCl (pH 7.8), 0.1 mM EDTA. pGS4088 was digested with XhoI and the ⁻⁻ 1100 bp fragment containing the α-globin expression cassette was isolated from an agarose gel. The purified fragment was dissolved (20 ug/ml) in 0.1XTE. Twenty-five nanograms of SalI digested pSN(+) was mixed with 50 ng of the gel-purified α-globin fragment in 0.01 ml ligase buffer containing 10 units of T4 DNA ligase. The ligation mixture was incubated for 1.5 hr at 25° C. and a portion used to transform E. coli DH5α. Ampicillin resistant clones were isolated. One hundred transformants were transferred, in a grid pattern, to fresh plates. Replicas of the grid were made on a nitrocellulose filter and prepared for colony hybridization (R. W. Davis et al. Adv Bacterial Genetics, Cold Springs Harbor, N.Y., 1980.). The hybridization probe was oligonucleotide 4 (Table 6). This oligonucleotide (20 pM) was labelled with ³² P-ATP in a 0.02 ml reaction mixture containing 0.050M Tris-HCl (pH 7.6), 0.01M MgCl₂, 0.005M DTT, 0.0001M spermidine, 0.0001M EDTA, 50pM of ³² P-ATP and 10 units of T4 polynucleotide kinase. The reaction mixture was incubated for 2hr at 37° C., unincorporated ATP was removed by spin-column chromatography (BIORAD) used according to the manufactures instructions. The filters were incubated (37° C.) in hybridization solution (6 XSSC, 50% formamide, 2% SDS, 20 pMoles of ³² P labelled probe) for 16 hr. The filters were washed 4 times (15 min each) with 1× SSC, 0.1% SDS at 55° C.; twice (5 min each) with 0.2 XSSC, 0.1% SDS at 50° C. and twice (5 min each) with 0.2XSSC. Autoradiographs were made of the filters. Colonies that produced a hybridization signal were used to prepare plasmid DNA. This DNA was digested with EcoR1 and the fragments produced by this digest were analyzed by agarose gel electrophoresis. A plasmid containing a fragment of the correct size was identified and designated pGS4888 (FIG. 23(a)).

CLONING THE pGGAP-β-GLOBIN EXPRESSION CASSETTE INTO pGS4888

The construction of pGS4888 required cloning an XhoI fragment (pGGAP-α-globin) into a SalI site. The combination of a XhoI site with a SalI site destroys the recognition sites for both XhoI and SalI restriction endonucleases, leaving pGS4888 with a single unique XhoI site. An XhoI fragment from pGS3888 was purified by agarose gel electrophoresis and cloned into XhoI digested pGS4888 essentially as described for the construction of pGS4888. Colony filter hybridizations were performed using the NcoI-SalI fragment (containing β-globin cDNA) from pGS3888 as a hybridization probe. This fragment was purified by agarose gel electrophoresis and radioactively labelled with α³² P-dCTP using an Amersham random primed oligonucleotide labelling kit. Filters were hybridized, washed and autoradiographs made as described for the construction of pGS4888. Plasmid DNA was prepared from colonies producing a hybridization signal and digested with either NcoI or NcoI and SphI. These digests allow identification of plasmids that contain both the α- and β-globin expression cassettes and reveals the relative orientation of the α- and β-globin transcriptional units. A plasmid containing both transcriptional units was identified and given the designation pGS189 (FIG. 23(b)).

CONSTRUCTION OF PLASMID pC1N

The plasmid pC1U was modified to introduce a NotI site. pC1U DNA (100 ug) was digested with SalI, phenol-chloroform extracted and ethanol precipitated. The SalI ends were modified with T4 DNA polymerase to produce "blunt" ends and phosphorylated NotI linkers were added. The procedures used for these modifications are essentially the same as those used to produce pSN(+) (described above). The plasmid resulting from these manipulations is called pCIN (FIG. 23(b)). This pC1N was digested with Notl, phenol-chloroform extracted and ethanol precipitated. A - 2.4 kb NotI fragment (carrying α- and β-globin expression cassettes) was purified from NotI digested pGS189 DNA by agarose gel electrophoresis. Fifty nanograms of NotI digested pC1N was mixed with 50 ng of the gel purified, 2.4 kb fragment isolated from pGS189 in 0.01 ml of ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 25° C. for 2hr and a portion used to transform E. coli DH5α. Ampicillin resistant clones were selected on LB-ampicillin plates and plasmid DNA prepared. The purified DNA was digested with EcoR1 and analyzed by agarose gel electrophoresis to identify plasmids carrying the α- and β-globin genes and to determine the orientation of the insert with respect to the vector. Two plasmids were identified, representing the two possible orientations and have been designated pGS289 (FIG. 23(b)) and pGS389 (FIG. 23(b)).

EXPRESSION OF RECOMBINANT HUMAN HEMOGLOBIN IN SACCHAROMYCES CEREVISIAE

S. cerevisiae strains GSY112 (MATαpep4::H1S3prb 1 1.6 R his3 200 ura3-52 leu2::hisG can1cir°) and RSY334 (MATA- reg1-501 pep4-prb1-1122 ura3-52 leu2-3, leu2-112) were transformed with plasmids pGS289 or pGS389 by the method of Ito et al. (J. Bacteriology 153:163-168 (1983)). Transformants were selected on yeast SD-ura. Single colony isolates were picked and streaked to SD medium lacking uracil and leucine. Colonies from this selective medium (SD-ura,-leu) were used to inoculate 2 ml of SD-ura,-leu. The cultures were incubated at 30° C. for 24hr and used to inoculate 20 ml cultures of YP +3% ethanol. These were incubated for an additional 24hr and galactose was added to a concentration of 2%. Samples were removed at 4, 8, 24, 28, 32 and 48 hours post induction. Cells were collected by centrifugation and washed with 1 ml 10 mM Tris HCl (pH 7.8), 1 mM EDTA and resuspended in SDS-PAGE sample buffer (2×10⁸ cells/ml). Samples were boiled for 10 min and the insoluble material removed by centrifugation. Samples 0.005 ml were analyzed by SDS-PAGE (12.5% gel) followed by transfer to nitrocellulose. α- and β-globin chains were stained using commercially available rabbit anti-human hemoglobin and an immunoblotting kit purchased from Promega. The protocols used were supplied by Promega. The immunoblot indicated that both chains are synthesized and that pGS289 produces slightly less material than pGS389. An apparent lack of stoichiometry between the α-chain (lower band) and β-chain is due to a difference in immunoreactivity of the antibody to the two chains. This was demonstrated by comparing Commassie brilliant blue stained gels of purified human and recombinant hemoglobin with immunoblotted samples.

EXAMPLE 20 Characterization Of Human Recombinant Hemoglobin Synthesized in Yeast

RSY334 pGS389! was grown in 100 ml SD-leucine to an OD₆₀₀ of 2.4. This was used to inoculate 1 L of YPE. This culture was shaken at 30° C., for 24hr at which time 50 ml of 40% galactose was added. The incubation was continued and the cells harvested 24 hr later (OD₆₀₀ of 9). The cell pellet was resuspended in 100 ml of 0.01M Tris-HCl (pH 7.18), 0.001M EDTA and carbon monoxide bubbled through the cell suspension for 3 min. The cells were collected by centrifugation and after the CO treatment were distinctly red. The cell pellet (19 gm) was resuspended in 19 ml of lysis buffer (0.01M NaPO₄ pH 6.0, 0.020M DTT, 1% Triton×100, 0.001M PMSF, 0.005M benzamidine and 0.06 mM leupeptin) and bubbled with carbon monoxide for 2 min. The cell suspension was sonicated with Branson 250 sonicator equipped with a 0.5 inch disrupter horn. The sonication time was 2 min (0.5 sec pulses) at full power. This was repeated 4 times with 2 min cool-down periods between sonication intervals. The cell debris was removed by centrifugation (27,000× g for 15 min) and the supernatant saved (0° C.). The pellet was resuspended in 5 ml of lysis buffer. The resuspended pellet was sonicated with the microtip as described above (70% of maximum output). The cell debris was removed by centrifugation as described above and this supernatant combined with the first. The combined solutions were clarified by centrifugation (38,000× g, 20 min), producing a clear, red solution. This was loaded (after adjusting the pH to 6.0, with 10 mM phosphoric acid) onto a 5 ml S-Sepharose fast flow column equilibrated with 0.01M sodium phosphate (pH 6.0). All of the red material bound to the column and the column was washed with 20 ml of 0.01M sodium phosphate (pH 6.0). The column was eluted with 0.05M sodium phosphate (pH 7.5), 0.1M NaCl and 1 ml fractions were collected. The red color eluted in two fractions. The purity of this material was analyzed by SDS-PAGE and appears to be ≧50% pure after the first chromatography step. This material was dialyzed against 10 mM sodium phosphate (pH 6.0), 0.001M EDTA for 16 hr at 0° C. and re-chromatographed on a mono-S FPLC column (ph 6.8-pH 9.0, 0.02M sodium phosphate gradient). The peak fraction from this is ≧85% pure. An absorbtion spectrum was obtained by scanning the mono-S purified material from 400 mM to 650 mM with a Shimadzu spectrophotometer (FIG. 24, top). The spectrum obtained was identical to that of human hemoglobin (FIG. 24, bottom), indicating that the protein had folded and incorporated heme. The amount of hemoglobin recovered in the two peak fractions was determined from the extinction coefficient (1.23×10⁴ A450M) to be ¹³ 20 mg.

EXAMPLE 21 Construction of Vectors for the Expression of α-and β-Globin from Separate Yeast Plasmids

In addition to the development of a single yeast vector that carries both α- and β-globin expression cassettes we also developed a system that uses separate plasmids for each of the two globin cDNA's. The two plasmids each carry two yeast genes that are used to maintain the plasmid in yeast. Both have the LEU2 gene in common and one (pGS4688) has the URA3 gene, the other (pGS4988) has the TRP1 gene. By using a host that carries mutations in URA3, LEU2 and TRP1 both plasmids (one with α-globin and the other with the β-globin expression cassette) can be maintained. The constructions of these vectors is described below.

CONSTRUCTION OF pC1T

The plasmid pC1U (5 ug) was digested with BamHI and SalI and the largest fragment was purified by agarose gel electrophoresis. The plasmid YRp7 (10 ug) (J. Strathern, E. Jones, and J. Broach, The Molecular Biology of the Yeast Saccharomyces, Cold Spring Harbor, N.Y., 1981) was digested with BglII and SalI and the fragment containing the TRPI gene was purified by agarose gel electrophoresis. Twenty-five ng of gel-purified, BamHI and SalI digested pC1U was mixed with 50 ng of the BglII-SalI fragment in ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 25° C. for 1.5 hr and a portion of the ligation reaction mixture used to transform E. coli DH5α. Tetracycline resistant colonies were selected on LB-tetracycline medium. Plasmid DNA was prepared from 15 transformants, digested with EcoR1 and analyzed by agarose gel electrophoresis. One isolate with the expected EcoR1 restriction fragments was chosen and was designated pC1T (FIG. 22(a)).

CLONING THE β-GLOBIN EXPRESSION CASSETTE IN pC1T

One hundred nanograms of pC1T was digested with SalI, phenol extracted, ethanol precipitated and dissolved in 0.01 ml of 10M Tris-HCl (pH 7.8), 1 mM EDTA. The plasmid pGS3888 was digested with XhoI and the ¹³ 1.2 kb XhoI fragment containing the β-globin expression cassette was purified by agarose gel electrophoresis. Ten nanograms of SalI digested pC1T was mixed with 60 ng of the XhoI fragment containing the β-globin expression cassette in 0.01 ml of ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 25° C. for 30 min. A portion of this material was used to transform E. coli DH5α. One hundred ampicillin resistant transformants were picked and patched to LB-ampicillin agar. They were incubated for 5 hr at 37° C. and overlayed with a nitrocellulose filter. Plasmids containing the XhoI fragment were identified by colony hybridization, using the XhoI fragment as a hybridization probe, as described above. Colonies producing an autoradiographic signal were used to prepare plasmid DNA. The purified DNA was digested with EcoR1 and analyzed by agarose gel electrophoresis for the presence of the expected restriction fragments. Two clones containing the desired XhoI insert were identified and the orientation of the insert was determined by agarose gel analysis of EcoR1 digests or SphI digest of the plasmid DNA. Both isolates contained the desired insert in the same orientation and one was designated pGS4988 (FIG. 22(a)).

CLONING α-GLOBIN EXPRESSION CASSETTE IN pC1U

The Xhol fragment from pGS4088 containing the α-globin expression cassette was purified by agarose gel electrophoresis and the recovered fragment dissolved (100 ng/0.01 ml) in 1 mM Tris-HCl pH 7.8, 0.1 mM EDTA. Twenty-five nanograms of the gel purified fragment was mixed with long of SalI digested (phenol-chloroform extracted, ethanol precipitated) pC1U in 0.01 ml ligase buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 25° C. for 1.5 hr and a portion used to transform E. coli DH5α. Ampicillin resistant colonies were selected on LB-ampicillin plates. Plasmid DNA was prepared from 12 transformants, digested with HindIII and analyzed by agarose gel electrophoresis. The two possible orientations were isolated and given the designation pGS4488 (FIG. 22(b)) and pGS4688 (FIG. 22(b)).

CONSTRUCTION OF DIPLOID STRAINS EXPRESSING α- AND β-GLOBIN CHAINS FROM SEPARATE PLASMIDS

S. cerevisiae RSY330 (MATα pep4-3 prb1-112 hist7 ura3-52 trp1-289 can1 gal1) was transformed (Ito et al.) with pGS4488 or pGS4688. Trp⁺ transformants were selected on SD-trp medium and streaked for single colonies. S. cerevisiae BJY1991 (MATα prb1-112 pep4-3 leu2-3,112 trp1- 101 ura3-52 gal2 can1) was transformed with pGS4988. URA⁺ transformants were selected on SD-ura medium and streaked for single colonies. These strains were each tested for the production of α- and β-globin as follows: (1) Single colonies were picked from SD-selective medium and used to inoculate 2 ml of SD-selective (liquid) medium. The cultures were incubated for 24 hr at 30° C. and diluted into 25 ml of fresh SD-selective medium and incubated for an additional 24 hr. The cells were collected by centrifugation and resuspended in 25 ml of YP-galactose (2%) and the incubation continued for an additional 24 hr. The cells were harvested (8,000× g, 10 min) and the pellet washed with 50 ml 0.010M TrisHCl(pH 7.8), 0.001M EDTA. The pellets were dissolved by heating to 96° C. for 10 min in SDS-PAGE sample buffer and the debris removed by centrifugation (15,000× g, 10 min). The cleared supernatants from 1×10⁶ cells each were analyzed by SDS-polyacrylamide gel electrophoresis and Western immunoblotting as described above. β-globin was readily detectable in extracts from BJY3505 pGS4988!. We were also able to detect α-globin cross-reacting material, although the signal strength was considerably weaker. The production of tetrameric hemoglobin requires the presence of both α and β chains, ideally these would be expressed in the same cell. Because strains BJY3505, BJY1991 and RSY330 are haploids they each can be mated with a yeast strain of the opposite mating type. Strains RSY330 and BJY1991 are both mating type a, whereas BJY3505 is mating type a. BJY3505 pGS4988!, RSY330 4688! or BJY1991 4688! or RSY330 pGS4688! matings were done and diploids selected by streaking onto SD minimal medium with no additional amino acids or other nutrients. Neither of the plasmid-bearing parental strains are capable of growth on this medium, diploids, however, can grow. Because the diploids are homozygous for the mutations in TRP1 and URA3 both plasmids must be present for the cells to grow in the absence of these nutrients. These diploid strains were analyzed for the synthesis of α and β-globin as described above. A most surprising result was obtained. Although α-globin and β-globin are synthesized at low levels in the haploid strains, co-expression in a diploid strain results in a substantial increase in the levels of both chains. Furthermore, after induction (24 hr) with galactose the cell pellets develop a distinct, pink-red color. These results suggest that: (a) co-expression of α- and β-globin stabilizes the two proteins, perhaps as a consequence of their interaction and (b) the protein is apparently folding and incorporating heme.

EXAMPLE 22 Oxygen Binding Properties of Yeast-Derived Hemoglobin

The P₅₀ of the yeast derived hemoglobin is nearly identical to that of Hgb Ao. Depending on how we measure it, the P₅₀ ranges from about 5 to 10 torr in a phosphate free solution. At 25° C. it is about 4-6 torr in 50 mM BisTris, pH 7.4, NaC1 0.1M. In the same solution, at 37° C., the P₅₀ is from 8.5 to 11.

EXAMPLE 23 Expression of Di-Alpha Hemoglobin in S. Cerevisiae

Methods

Unless stated otherwise, all enzymes (restriction endonucleases, T4 DNA ligase, T4 DNA polymerase, T4 polynucleotide kinase) were purchased from New England Biolabs, Pharmacia, BRL, Stratagene or Boerhinger Mannheim. Restriction enzymes and T4 DNA ligase were used with the buffers supplied by the manufacturers.

Ethanol precipitation of nucleic acids was carried out by the addition of 0.5 volumes or 7.5M ammonium acetate and 2 volumes of 20:1 ethanol-isopropanol. The pellet was collected by centrifugation at 14,000× g for 15 minutes, washed twice with 80% ethanol, once with 95% ethanol and dried in vacuo. Phenol extractions were done by the addition of 50:49:1 mixture of phenol:chloroform:isoamyl alcohol. Phases were separated by centrifugation at 14,000× g and the aqueous phase collected. Plasmid DNA was purified from E. coli DH5α as described by Birnborn and Doly (Nucleic Acids Research 1979, 7:1513-1520). Electrophoretic analysis of DNA was carried out in agarose gels using tris-acetate electrophoresis buffer (Maniatis, et al. Molecular Cloning, Cold Springs Harbor, N.Y., 1982). DNA was visualized by staining the gels with 0.5μg/ml ethidium bromide and exposing the gels to ultraviolet light. DNA fragments were purified from agarose gels using a kit purchased from BIO-101. DNA fragments were purified from acrylamide gels by crushing the excised gel fragment in 3.25M ammonium acetate and incubating overnight at 37° C. Gel fragments are removed by centrifugation (14,000× g, 15min) and the DNA precipitated with ethanol. The precipitate is dissolved in TE (10 mM Tris HC1, pH 7.8, 1 mM Na₃ EDTA). Acrylamide gel electrophoresis of DNA was done as described by Maniatis, et al. (Molecular Cloning, Cold Spring Harbor Laboratory, N.Y., 1982). Bacteriological growth mediα and DNA transformation methods are as described by R. W. Davis, et al. (Advanced Bacterial Genetics, Cold Spring Harbor Laboratory, N.Y., 1980). Mediα for the growth of S. cerevisiae has been described by F. Sherman et al. (Methods in Yeast Genetics: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1979). Transformation of S. cerevisiae with linear or circular DNA was carried out as described by H. Ito, et al. (J. Bacteriology, 153:163-168(1983)).

Removal of the PstI and SpeI Sites from pGS4888

The design of the synthetic linker for joining two α-globin chains allows the inclusion of PstI and SpeI sites flanking a 30 bp sequence that includes the junction of the two α-globin coding sequences. Because we anticipate testing several different linker sequences, these sites will allow directional cloning of relatively short synthetic oligonucleotides encoding different linker sequences. Removal of the PstI and SpeI sites from the vector sequence is, therefore, necessary so that the sites in the coding region are usable. One μg of the plasmid pGS4888 was digested with PstI and ethanol precipitated. The dry pellet was resuspended in 50 μl of 33 mM Tris-acetate, pH 7.9, 66 mM potassium acetate, 10 mM magnesium acetate, 0.5 mM DTT and 50 μM of each dNTP (T4 polymerase buffer). Two units of T4 DNA polymerase were added and the reaction mixture incubated for 15 min at 37° C. Na₃ EDTA was added to 12.5 mM and the reaction mixture heated to 65° C. for 15min, phenol extracted and ethanol precipitated. The dry pellet was dissolved in 14 μl of T4 DNA ligase buffer (BRL) and 1 μl (10 units) of DNA ligase added. The ligation mixture was incubated a 4° C. for 16hr. A portion of the ligation reaction was used to transform E. coli DH5α and transformants were selected on LB-ampicillin plates. Plasmid DNA was prepared from 12 transformants. The DNA was analyzed by agarose gel electrophoresis of PstI digests. Five transformants had lost the PstI site and one of these was designated pGS1889. The SpeI site of this plasmid was removed as described above after digestion of pGS1889 with SpeI. A plasmid was identified that had lost both the PstI and the SpeI site and was designated pGS1989.

Strategy for Joining Two Copies of the α-Globin cDNA

A fragment containing the 5'-363 bp of the α-globin coding region can be excised from pGS4888 as an NcoI to ApaL1 fragment. A second fragment containing nucleotides 109 through the 3' untranslated region can be removed as a FokI to SalI fragment. These two fragments can then be joined together to create a single translation unit encoding two tandem copies the of α-chain cDNA by using a synthetic oligonucleotide adaptor (Table 8). The purification and assembly of these fragments is described below.

The plasmid pGS4888 was sequentially digested with ApaL1 and NcoI and a ⁻⁻ 365 bp fragment (α-chain 1) purified by acrylamide gel electrophoresis. A second fragment of ⁻⁻ 351 bp (α-chain 2) was prepared by sequential digest with FokI and SalI followed by acrylamide gel electrophoresis. Four oligonucleotides were synthesized (Table 8) and assembled into a ⁻⁻ 173 bp linker (RGGV) containing ApaL1 and FokI ends. Synthesis, purification and assembly of oligonucleotides was as described previously. This adaptor encodes an amino acid bridge linking the carboxy-terminus of α-chain1 with the amino-terminus of α-chain2 as well as portions of the 3' end of α-chain1 and the 5' end of α-chain2. The carboxy terminal arginine residue of α-chain1 is separated from the amino terminal valine of α-chain2 in this construct by 2 glycine residues. The "diglycine bridge" portion is flanked by HpaI, SpeI and PstI sites. These sites allow the substitution of a variety of bridges by the use of ⁻⁻ 30 bp adaptors to connect the two α-chains. The assembly of the two α-chain genes into an expression cassette was carried out in a four part ligation as described below.

Plasmid pGS1989 was sequentially digested with NcoI and SalI and the large fragment containing the plasmid vector and pGALGAP was purified by agarose gel electrophoresis (gp1989). Fifty nanograms of gp1989 were mixed with 200 ng each of the gel purified ApaL1-NcoI α-chain1 fragment and the FokI-SalI α-chain2 fragment. A twenty-fold molar excess of the synthetic ApaL1-FokI adaptor was added (the 5'-ends of the adaptor segment are not phosphorylated). The ligation reaction was carried out in a volume of 20 μl for 4 hr at 23° C. A portion of this reaction mixture was used to transform E. coli DH5α and ampicillin resistant colonies were selected. Transformants were patched to nitrocellulose filters and screened by hybridization with the ³² P-labelled oligonucleotide AL2as (Table 8). Five pMoles of oligo AL2-as were incubated in 20 μl of a solution containing 2 units of T4 polynucleotide kinase, 50 mM Tris-HC1 (pH 7.6), 10 mM MaC1.6 mM DTT. 0.1 mM spermidine. 0.1 mM EDTA, and 10 pMoles of c³² P-ATP (7000 Ci/mMole) for 2 hr at 37° C. Filters were processed as described by Maniatis, et al. and the hybridization was done at 37° C. in 5×SSC, 50% formamide, 100 μg/ml yeast tRNA and 2% SDS for 16 hr. The filters were washed, sequentially, in 2×SSC and 1×SSC at 55° C. for 15 min (twice in each, all wash solutions contained 1% SDS). Dried filters were exposed to X-ray film and colonies giving a hybridization signal were used to prepare plasmid DNA. Plasmid DNA was analyzed by restriction enzyme digestion to identify plasmids that contained inserts of a size consistent with two α-chains (NcoI-SalI digest) and that contained unique PstI, SpeI and HpaI sites. A plasmid identified in this manner was designated pGS2189 (FIG. 25).

An XhoI fragment from pGS3888 containing the β-globin expression cassette was purified by agarose gel electrophoresis. XhoI digested pGS2189 (50 ng) was combined with 150 ng of the gel purified insert from pGS3888 in 10 p1 of ligation buffer containing 10 units of T4 DNA ligase. A portion of this mixture was used to transform E. coli DH5α and ampicillin resistant colonies were selected. Plasmid DNA was isolated and analyzed by digestion with XhoI, BamHI or NcoI. Several plasmids were identified that produced restriction fragments of the expected sizes, all contained inserts in the orientation shown in FIG. 25. One of these was designated pGS2989. Although this plasmid contains the linked α-globin genes and a β-globin gene under the control of separate promoters, it is not capable of replication in S. cerevisiae. The entire expression cassette, containing the two genes (diα and β-globin), can be purified as a NotI fragment. Ten μg of pGS2989 was digested with PvuI and NotI and the NotI fragment gel purified. The digestion with PvuI was done to reduce the size of the vector sequences which otherwise would comigrate with the desired NotI fragment. Two hundred ng of the gel purified NotI fragment was combined with 50 ng of NotI digested pC1N in 10 μl of ligation buffer containing 10 units of T4 DNA ligase. The reaction mixture was incubated at 4° C. overnight and a portion was used to transform E. coli DH5α. Ampicillin resistant transformants were selected and plasmid DNA prepared. DNA was digested with NcoI-SalI, PstI or NotI to identify plasmids with the diα, β-globin expression cassette and to determine the orientation of the inserted fragment. Several plasmids were identified that contained the correct insert, all of which have the inserted fragment in the same orientation. One of these was designated pGS3089 (FIG. 25). This plasmid was used to transform strains GSY112 and RSY334.

Expression and Purification

S. cerevisiae strains GSY112 pGS3089! and RSY334 pGS3089! were grown to saturation in SD-uracil medium and diluted into 2 L of YPD medium (all cultures were incubated at 30° C.). Twenty-four hours after inoculation (of the YPD culture), galactose was added to 1% and the cultures incubated for another 24 hr. Carbon monoxide was bubbled through the culture and the cells collected by centrifugation. A 1:1 mixture (weight:vol) of cells and breakage buffer (10 mM sodium phosphate (pH 7.2), 1 mM EDTA, 1 mM EGTA and 5 mM DTT) were disrupted in a "Bead-Beater" (Biospec Products, Bartlesville, Okla.). Debris was removed by centrifugation (10,000× g, 30 min). The soluble fraction was adjusted to pH 6.0 with phosphoric acid and chromatographed on a column of S-sepharose (fast flow) equilibrated in 10 mM sodium phosphate pH 6.0. The loaded column was washed with 10 mM Tris-HC1, pH 6.7 and hemoglobin eluted by washing with 20 mM Tris-HC1, pH 7.5. A bright red band was collected and the pH adjusted to 8.0, by the addition of NaOH. This material was then chromatographed on Q-sepharose (fast flow) equilibrated in 20 mM Tris-HCl, pH 8.0. Hemoglobin was eluted with a NaCl gradient (0-0.4M). A final chromatography step was carried out on Sephacryl S-200 equilibrated in 5 mM NaPO₄, pH 7.4, 0.1M NaCl. Each step of the purification protocol was analyzed by SDS-polyacrylamide gel electrophoresis (Laemmli, U.K., 1970, Nature 227:680-685) and staining with Commassie brilliant blue. The purified protein contains a band that comigrates with monomer β-globin and a band in the expected position for α-globin dimer. This material comigrates with human tetrameric hemoglobin when analyzed by size exclusion chromatography (Progel TSK G3000 SWXL HPLC column). This protein is red and binds 02 with a 50% binding affinity (P50) of 8-10 Torr, indicating that it has incorporated heme and is capable of reversible 02 binding.

The purified protein was separated into α- and β-chains by reverse phase HPLC and the sequence of the 10 amino-terminal residues of each chain were determined. The sequence matched that of bona fide human hemoglobin, indicating that the initiating methionine had been efficiently removed from both the α-globin dimer and the β-globin dimer.

EXAMPLE 24 Construction of Low Affinity, Genetically Cross-Linked, Hemoglobin Mutants and Expression in Yeast

24.1 Construction of Vector for Site Directed Mutagenesis

The XhoI fragment containing the β-globin gene and its GALGAP promoter was isolated from pGS2989 by preparative agarose gelelectrophoresis and ligated with XhoI digested Phagescript (RF-form, obtained from Stratagene, Inc.). E. coli XL1-Blue was transformed with the DNA ligation mixture and phage containing inserts were identified (white plaques on medium containing XGAL). Single plaques were isolated and DNA was prepared and analyzed by digestion with XhoI and agarose gel electrophoresis. This construct was designated M13β-globin.

24.2 Preparation of Single Stranded Template

A saturated culture of E. coli XL1-Blue (available commercially from Stratagene, 11099 North Torrey Pines Road, LaJolla, Calif. 92037) (200 μl) was used to inoculate 4 ml of 2× YT broth. This culture was incubated for 2 hr. at 37° C. and then infected with a single M133-globin phage plaque and the incubation continued for 6-8 hrs. Cells were removed by centrifugation and discarded. Phage were precipitated from 1.6 ml of clarified medium by the addition of 320 μl of cold 30% PEG 8000 in 3.5M ammonium acetate followed by incubation on ice for 30 min. Phage were collected by centrifugation and resuspended in 0.1 ml of 10 mM Tris-HCl pH 8.0, 1 mM EDTA (TE). DNA was isolated by extracting twice with phenol/cholorform. DNA (contained in the aqueous phase) was precipitated by the addition of NaCl to 0.5M and two volumes of 95% ethanol. The DNA pellet is dissolved in 20 μl of water.

24.3 In Vitro Mutagenesis Reactions

Two hundred ng of template DNA are mixed with a twenty-fold molar excess of the appropriate, phosphorylated mutagenic oligonucleotide in 10 μl of 20 mM Tris-HCl, pH 7.4, 2 mM MgCl₂, 50 mM NaCl and heated to 70° C. for 5 minutes. The annealing reaction is allowed to slowly cool (40 min) to 40° C. and then to 30° C. (15 min). After the last annealing step the mixture is transferred to ice for 5 min. To this mixture 1 μl of synthesis buffer (4 mM each dNTP, 7.5 mM ATP, 175 mM Tris-HCl pH 7.4, 37.5 mM MgCl₂ 215 mM DTT), 0.5 μl of T4 gene32 protein (2 μg/μl), 1 μl of T4 DNA ligase (3 units), and 1 ml of T4 DNA polymerase (1 unit) are added. The mixture is incubated at 37° C. for 90 min (following 5 min at room temperature). The reaction is stopped by the addition of 90 ml of 0.1M Tris-HCl (pH 8.0) and 0.1M EDTA. Approximately 0.1 μl of the reaction mix was used to transfect E. coli XL1-Blue cells (Cells were prepared for transformation by the method of D. Hanahan, 1983. J. Mol. Biol. 166:557).

24.4 Screening for Phage Containing the Mutant Gene

Approximately 50-100 plaques were picked to fresh plates seeded with the appropriate host strain (XL1-Blue) in ordered arrays. After incubation for 6-8 hrs. at 37° C. the plates were overlaid. with nitrocellulose filters and prepared for hybridization essentially as described in Davis, R. W. et al. (Advanced Bacterial Genetics: A Manual of Genetic Engineering. Cold Spring Harbor Laboratory, New York 1980). The choice of hybridization temperature with the mutagenic oligonucleotide was determined on the basis of the nucleotide composition of the oligonucleotide. The oligonucleotide was labelled with C³² P-ATP and polynucleotide kinase (T. Maniatis et al. Molecular Cloning: A Laboratory Manual. Cold Spring Harbor Laboratory, 1982). Hybridizations were done at 2-5° C. below the calculated Tm for the correct match in 6×SSC, 2%SDS, 100 μg/ml yeast tRNA using 10⁵ cpm/ml of the labelled oligonucleotide for >6 hr. The Tm was calculated using the formula: 4° C. for every GC pair +2° C. for each AT pair assuming an Na+ concentration of 1M. Filters were washed at the same temperature as the hybridization in 5×SSC, 2% SDS and exposed to XRay film. Plaques giving positive hybridization signals were used to prepare single stranded DNA, as described above. The single-stranded DNA was used as template for sequencing reactions (Sanger, F. and Coulson, A. R. 1975, J. Mol. Biol. 94:441) to confirm that the mutant sequence was indeed present.

Oligonucleotides used for Mutagenesis

βN108K-"Presbyterian" 5'-AGGCTCCTGGGCAAGGTGCTGGTCTGT-3'

βE9OK-"Agenogi" 5'-GCCACACTGAGTAAGCTGCACTGTGAC-3'

βV67I-(No Alias) 5'-CATGGCAAGAAAATCCTCGGTGCCTTT-3'

β102T-"Kansas" 5'-GTGGATCCTGAGACTTTCAGGCTCCTG-3'

24.5 Cloning into Yeast Expression Vectors

Phage RF DNA was prepared from phage-infected cells that had been confirmed to have the mutant sequence and the XhoI fragment containing the altered β-globin gene was purified by agarase gel electrophoresis. This fragment was cloned into a derivative of pGS3089 that had been altered to change the di-α-crosslink from the diGly configuration to a single glycine bridge, and from which the β-globin gene had been deleted, and designated pGS3889RGVdesβ (FIG. 26). This created a unique XhoI site into which altered β-globin expression cassettes could be inserted, allowing coexpression with the α-globin dimer. The expression plasmids generated in this way (pGS3889 for "Presbyterian", pGS5189 for "Agenogi", pGS5689 for "Kansas" and pGS4989 for "βV67I", all identical to pGS3089 except for mutation in specified codon) were used to transform S. cerevisiae strains GSY112 and RSY334.

24.6 Characteristics of a Genetically Crosslinked Low Affinity Mutant Protein Expressed in Yeast

Cells bearing the plasmid pGS3889 (single gly bridge, BN108K alias "Presbyterian") were grown and hemoglobin purified as previously described. This material when analyzed for functionality was substantially "right-shifted" compared to the crosslinked protein with a wild type β-chain (P₅₀ =23-25 for the mutant, with N=2.5).

EXAMPLE 25 Effect of Choice of Strain and Induction Temperature on Expression of Di-Alpha Hemoglobin in E. Coli

Table 100 contains comparisons of di-alpha/beta fermentations. It also provides comparison of temperatures of induction. The column labeled "mg di-A+B" is total mg di-alpha and beta polypeptides per fermentation. The adjacent column "mg/OD-L" simply expresses the first column number on a cell density basis. The two columns labeled RHGB present total and cell density-corrected output of functional recombinant hemoglobin. The last column shows that in terms of final functional hemoglobin recovery, strain JM109 is preferable. Without binding ourselves to any theory, we believe that this difference has to do with the proteases expressed in different strains. It is interesting to note that of the two best JM109 runs, one induction was at 30° and one at 37° with roughly equivalent amounts of final functional Hb produced.

EXAMPLE 26 Construction of Genetically Cross-Linked α-Globin Dimers Connected By a Single Glycine or Proline Residue

The following synthetic adaptors for altering the diα-globin bridge were synthesized and purified as described in previous sections. ##STR9##

Complementary pairs of RPV and RGV oligonucleotides were combined (2.4 μg of each oligonucleotide) in 0.05 ml of water. The two pairs of adaptors were precipitated (separately) by the addition of 2.0 μl of 4M NaCl and 0.158 mL of 100% ethanol followed by centrifugation. The pellets were washed with 80% and 100% ethanol and dried. The adaptors were dissolved in 24 μl of TE.

26.1 Cloning of the RGV and RPV Adaptors

The plasmid of pGS2989 was sequentially digested with the enzymes HpaI and PstI and the vector was purified after agarose gel electrophoresis. The digested plasmid was then ligated with a 10-fold molar excess of either the RGV or RPV adaptor (these adaptors were not phosphorylated) as previously described. A portion of the ligation mixture was used to transform E. coli DH5. Transformants were selected on LB plates containing ampicillin. Clones containing the new adaptor were identified by colony filter hybridization. The hybridization probes were either the RPV or RGV upper strands (as shown above). These proves were labelled with c32P-ATP and T4 polynucleotide kinase. Filters were hybridized with the appropriate probes (10⁵ cpm/ml) at 37° C. in a solution containing 6×SSC. 50% formamide, 2% SDS and 150 μg/ml yeast tRNA. Filters were incubated for >12 hrs. and washed 4 times with 2×SSC, 2% SDS (250 ml, 20 min each) and exposed to X-ray film. Colonies that produced autoradiographic signals were used to prepare plasmid DNA which was sequenced using the following primer: 5'-AAGCTTCAGCACCGTATTCA-3' (α² seql). This primer was specifically designed to minimize homology with the corresponding sequence in the al subunit of the dimer and to maximize homology with sequences near the 5' end of the α2 subunit of the diα dimer. This allows sequence to be determined reading from the a region through the sequence that bridges the two α-domains without a background sequence reading from a similar sequence near the 5'-end of the al-domain. The plasmids constructed in this way were designated pGS2989RPV(single proline crosslink) or pGS2989RGV (single glycine crosslink).

26.2 Cloning into Yeast Expression Plasmids

The NotI fragments from either pGS2989RPV or pGS2989RGV that contain the hemoglobin expression cassette were purified by agarose gel electrophoresis and subcloned into pC1N that had been digested with NotI. Plasmid DNA from ampicillin resistant transformants was isolated and analyzed by digestion with NotI and agarose gel electrophoresis. These plasmids were designated pGS3089RPV or pGS3089RGV. A second set of plasmids was generated from these to facilitate the substitution of different β-chain mutants. These plasmids were generated by digestion with XhoI and religation under dilute conditions <1 μg/ml). This favors deletion of the β-chain expression cassette. Plasmid DNA was isolated from ampicillin resistant transformants and the structures confirmed by digestion with NotI and XhoI (these plasmids have been designated pGS3089RGV-desβ, see FIG. 26, and pGS3089RPV-desβ).

EXAMPLE 27 Comparison of Renal Toxicity of Di-Alpha Hemoglobin and Des-Val Hemoglobin

Safety studies were conducted in both rats and dogs. Two rats received non-stabilized rHb and two rats received stabilized rHb (a di-alpha Hb with a di-glycine linker between the alpha globin domains) The following serum chemistry parameters were evaluated:

CK creatinine kinase

ALT alanine aminotransferase

AST aspartate aminotransferase

BUN blood urea nitrogen.

A substantial increase in these parameters, relative to baseline, 24 hours after administration, is indicative of renal toxicity: The following values were obtained:

    ______________________________________                                                          Non-                                                                           Stabilized                                                                              Stabilized                                           TEST             rHb      rHb                                                  ______________________________________                                         BUN baseline     19.5     17.5                                                 BUN 24 hours     109.5    26.5                                                 CK baseline      0.45     0.5                                                  CK 24 hours      1.35     0.5                                                  AST(SGOT) baseline                                                                              155.5    212.5                                                AST(SGOT) 24 hours                                                                              3444.0   942.0                                                ALT(SGPT) baseline                                                                              62.0     80.5                                                 ALT(SGPT) 24 hours                                                                              958.5    192.5                                                ______________________________________                                    

It is noted that the increase for the stabilized rHb is much less pronounced.

Dogs were first stressed by withdrawing approximately 30% of their blood volume, allowing them to stabilize for 30 mins, and then resuscitating with either rHb1.1, saline, human serum albumin or autologous blood.

Histopathology

No macroscopic or microscopic lesions were observed in organs (i.e., kidneys) at the end of the study that could be directly attributed to the administration of recombinant, stabilized hemoglobin.

Serum Chemistry

Serum chemistry data for dogs treated with rHb1.1 were evaluated for study day 2 (22-24 hours after hypovolemic stress) for CK (creatine Kinase), ALT (Alanine Aminotransferase), AST (Aspartate Aminotransferase) and Urea (Blood Urea Nitrogen, BUN). The AST and CK were notably increased in the rHb1.1 treated animals at day 2, however, these values returned to pretreatment levels by study day 4.

Urine Biochemistry

Urinalysis was not affected by the administration of rHb1.1. Other than a very slight decrease in NAG activity in all females (in all treatments) on study day 2, no notable findings were observed.

In summary, the magnitude and time course of cardiovascular and hemodynamic effects observed in severely hemorrhagic dogs resuscitated with recombinant di-alpha hemoglobin were nearly identical to those observed in dogs resuscitated with autologous blood. This study demonstrates that recombinant hemoglobin does not cause adverse cardiovascular, or hemodynamic effects when compared to HSA or autologous blood resuscitation in severely stressed dogs. Moreover, 30% blood volume resuscitation with recombinant di-alpha hemoglobin (1.1 g/kg) in hypovolemic dogs produced only limited, reversible effects, including transient neutropenia and elevation of blood plasma enzymes of skeletal muscle origin. No major organ systems were affected, no hemoglobin was found in the urine of recombinant hemoglobin treated dogs, and no histopathological changes were observed.

EXAMPLE 28 Construction of Di-α Globin Mono-Cysteine (A71C, D75C, or S81C) Mutant Expression Vector

28.1 Subcloning of the α Gene into Phagescript

The desfxα pGem (pDLII-83a) vector was cut with EcoR1 and Pst1 endonucleases and ligated into EcoR1/Pst1 digested phagescript (obtained from Stratagene). E. coli strain DH5α was transformed with the ligation mixture and cells were plated on 2xYT plates overlaid with 3 ml top agar containing 10 μl 100 mM IPTG, 25 μl 2% X-Gal in DMSO and 150 μl XL-1 cells (Stratagene). Clear plaques were picked and grown at 37° C. in 2xYT containing XL-1 cells. Double stranded DNA was isolated from the cultures and checked for the presence of the 500 bp α gene by restriction analysis and agarose gel electrophoresis. Single stranded DNA was isolated from one of the desfxα phagescript transformants (named f 191). The single stranded DNA was sequenced to confirm the presence of the desfxα gene in the phagescript.

28.2 Mutagenic Oligonucleotides

Three mutagenic oligonucleotides were used in three separate mutagenic reactions. The sequences of the oligonucleotides were as follows (mutant codon is underlined):

Nigeria mutation

αS81C

5'CCG AAC GCG TTG TGC GCT CTG TCT GAT 3' (SEQ ID NO: 5)

αD75C

5'GGT GCT CAC GTT GAT TGC ATG CCG AAC GCG 3' (SEQ ID NO: 6)

αA71C

5'CTG ACC AAC GCT GTT TGC CAC GTT GAT GAT 3' (SEQ ID NO: 7)

28.3 Kinase Reaction Conditions for Mutagenic Oligonucleotides A71C, D75C and S81C

1 μl oligonucleotide (approx. 300 pmol)

2 μl 10× kinase buffer containing 10 mM ATP 0.5 μl T4 polynucleotide kinase (10 U/μ, New England Biolabs)

15.5 μl H₂ O

1 μl 10 mM spermidine

Reactions were incubated for 1 hr. at 37° C., then 80 μl H₂ O was added and the reaction was terminated by heat inactivation at 65° C. for 10 min.

28.4 Mutagenesis Reaction

1 μl f 191 ss DNA (0.5 pmol)

3 μl kinased oligonucleotide (either A71C, D75C or S81C approx. 45 pmol)

2 μl 10 × annealing buffer (Promega)

14 μl H₂ O

The no primer control contained:

1 μl f 191 ss DNA

2 μl 10× annealing buffer

17 μl H₂ O

Reactions were heated to 65° C., cooled slowly to 35° C. (approx. 70 min), and put on ice for 5 min. The following reagents were added and the reactions were incubated at 37° C. for 90 min.

3 μl 10× synthesis buffer (Promega)

1 μl T4 gene 32 protein (0.5 μg/μl, Biorad)

1 μl T4 DNA polymerase (3 U/μl, NEB)

0.5 μl T4 DNA ligase (10 U/μl, NEB)

5 μl H₂ O

200 μl 71-18 mut S competent cells (made according to Promega Altered Sites procedure) were transformed with 10 μl of each mutagenesis reaction, put on ice for 30 min and heat shocked for 2 min at 42° C. The transformation mixture was added to 3 ml 2xYT mediα and grown at 37° C. (with shaking) for 5.5 hr. After incubation, 1 ml of each of the three cultures was removed, centrifuged and 800 μl was stored in a fresh tube at 4° C. as the stock solution of mutant phage.

28.5 Screening for Mutants of D75C

100 μl of a 10⁵ dilution of the D75C mutant phage stock was plated on 153 mm 2xYT plates overlain with top agar containing 0.5 ml XL-1 cells. Plates were incubated at 37° C. for approx. 5 hrs. Duplicate nitrocellulose filters were lifted off each plate and the plaques were lysed in 6 ml 0.5M NaOH/1.5M NaCl, neutralized in 10 ml 1M Tris-HCl pH 8.0/1.5M NaCl and washed in 500 ml 6×SSC. The filters were air dried and baked at 75° C. for 45 min. The filters were then boiled briefly in 1% SDS prior to prehybridization. Filters were prehybridized in 20 ml solution for 4 hr at 68° C. The prehybridization solution was as follows:

5×SSC (20×SSC prepared according to recipe in Maniatis).

0.1% (w/v) N-lauroylsarcosine

0.02% (w/v) SDS

0.5% blocking reagent (Genius Kit, Boehringer Mannheim)

The D75C oligonucleotide was labelled with γ.sup. 32! P ATP as follows:

1 μl oligonucleotide (80 pmol)

10 μl 10× kinase buffer

1 μl γ.sup. 32! P ATP (10 μC/μl. Specific activity>3000 Ci/mmol).

87 μl H₂ O

1 μl kinase (10U/μl, NEB)

The reaction was incubated for 5 hrs. at 37° C. Unincorporated ATP was removed by centrifugation through a Biospin 30 column (Biorad). The entire probe (17,000 cpm/μl) was added to the prehybridization mixture and the filters were hybridized overnight at 46° C. along with a no primer control filter. The following day, filters were washed for 10 min. at room temperature (RT) in 6×SSC and exposed overnight at -70° C. on Kodak X-Omat film. Filters were washed in 6×SSC at 57° C. for 10 min, dried and exposed overnight, then washed in 6×SSC/0.1% SDS at 67° C. for 10 min, and dried and re-exposed overnight. The final was in 6×SSC/0.1% SDS at 70° C. for 10 min and the filters were again dried and exposed overnight.

Ten plaques were picked which hybridized differentially to the mutant oligonucleotide (compared to the no primer control plaques). The plaques were placed in 5 ml 2xYT mediα containing 0.25 ml XL-1 cells. The cultures were incubated with shaking at 37° C. for 7.5 hr. 1 ml of each culture was removed, centrifuged 5 min, placed in a fresh tube and stored at 4° C. for subsequent sequencing and plaque purification.

28.6 Screening for Mutants of A71C and S81C

1 μl of a 10⁻³ dilution of the A71C stock phage mutagenesis reaction and 20 μl of a 10⁻⁵ dilution of the S81C mutagenesis reaction were plated on four separate 82 mm 2xYT/tet(10 mg/ml) plates overlaid with 3 ml top agar and 100 μl XL-1 cells. A no primer control was also plated as above. The plates were incubated at 37° C. for 5 hr; plaques were lifted from each plate onto nitrocellulose filters and the filters dried overnight at room temperature. The following day, the plaques were lysed with 0.5M NaOH/1.5M NaCl for 3 min, neutralized in 1M Tris-HCl pH 7.0/1.5M NaCl for 3 min and washed in 6×SSC for 5 min. Filters were air dried then baked at 75° C. for 1 hr. The filters were boiled briefly in 1.5% SDS prior to prehybridization at 60° C. for 6 hr. in 10 ml prehybridization solution as described above.

28.7 Labelling of A71C and S81C Oligonucleotides using Digoxigenin

(All reagents supplied by Boehringer Mannheim)

2 μl A71C (100 pmol) or 1 μl S81C (110 pmol)

10 μl terminal transferase buffer

5 μl 25 mM CoCl₂

1 μl 1 mM dUTP-digoxigenin

30 μl 31 μl H₂ O (A71C and S81C reactions respectively)

1 μl terminal transferase (25 U/μl)

Reactions were incubated at 37° C. for 3 hr. followed by 6 hr. at RT. Digoxigenin-labelled A71C and S81C probes (20 μl) were added to the appropriate filters in 10 ml prehybridization solution along with a no primer control filter. The filters were hybridized overnight at 47° C.

28.8 Filter Washes and Development

All filters were initially washed in 6×SSC/0.1% SDS for 15 min at 30° C., then for 15 min at 42° C.

Each of the four filters which had been probed with either the labelled A71C or S81C oligonucleotides were then separated and washed at increasingly higher temperatures along with a no primer control filter. One each of the A71C and S81C filters were placed in plastic bags containing 10 ml of 6×SSC/0.1% SDS and washed for 10 min at one of three temperatures, i.e., 50° C., 60° C. or 65° C. After the high temperature washes, each set of filters were developed according to the Genius Kit protocol.

Initially, bags containing the filters were filled with 10 ml of 100 mM Tris-HCl, pH 7.5/150 mM NaCl (buffer A) and incubated for 15 min. The buffer was removed and replaced with 10 ml buffer A containing 0.5% blocking reagent and incubated a further 15 min at RT without shaking. Anti-digoxigenin antibody (2 μl) was added directly to each bag and incubated with for 30 min at RT. The filters were then removed from their respective bags and washed altogether in 100 ml buffer A/0.05% blocking reagent for 15 min at RT, followed by a 15 min wash in buffer A alone at RT. The final wash was 100 ml 100 mM Tris-HCl, pH 9.5/100 mM NaCl/50 mM MgCl₂ (buffer B) for 5 min at RT. Each set of filters from a given temperature was placed in a separate bag along with 5 ml of color development solution (5 ml buffer B containing 22.5 μl 75 mg/ml NBT/15 μl 50 mg/ml X-phosphate). The filters were incubated for 30 min in the dark at RT. After 30 min, the filters were removed from the development solution, washed for 5 min in 100 ml 10×TE and 5 min in 100 ml 1× TE, both at RT. Filters were dried at RT.

Using the results from the Genius Kit screening procedure, 10 plaques which differentially hybridized to the labelled oligonucleotides A71C or S81C were picked and placed in 3 ml 2xYT mediα containing 0.25 ml XL-1 cells and incubated for 7.5 hr. at 37° C. with shaking. 1 ml of each culture was removed, centrifuged 5 min, placed in a fresh tube and stored at 4° C. for subsequent sequencing and plaque purification.

28.9 Confirmation of Mutations by Sequencing

Single stranded DNA was isolated from 800 μl mutant phage stock supernatant and sequenced using the Sequenase kit (USB) with the α 179 oligonucleotide as the primer. The a 179 aligonucleotide is an 18-mer homolog cys to a region about 100 bps upstream of the mutation site.) Sequencing confirmed the presence of the αA71C, αD75C and α S81C mutations.

Phage stock was plaque purified by plating 10 μl of 10⁻⁸ and 10⁻¹⁰ dilutions on 2xYT/tet (10 mg/ml) plates overlaid with 3 ml top agar containing 200 μl XL-1 cells. After 7 hrs incubation at 37° C., a single isolated plaque from each mutant plate was picked and used to inoculate 90 ml 2xYT/tet (10 mg/ml) mediα containing 10 ml XL-1 cells. Cultures were grown overnight at 37° C. with shaking. 1 ml of each mutant phage culture was removed, centrifuged and the supernatant was frozen at -80° C. as the respective purified mutant phage stock. Double stranded DNA was prepared from the remaining culture for use in the subsequent subcloning steps into the final expression vector1.1E4.

28.10 Subcloning of the α cys Mutants into 1.1E4

Construction of the di-α gene with each of the three cysteine mutations in either the N-terminal or C-terminal domain of the di-α protein required three subcloning steps:

1) Transfer of the cys mutant a gene from phagescript as an Eag1-Pst1 fragment into the Eag1-Pst1 digested desval a pGem vector (pDL II-91f, see #10, table 200). This step provided the mutant a gene with the correct 5' terminus.

2) A mutant di-α gene with each of the cys mutations in the 3' a gene was constructed by inserting the Eag1 DNA fragment from di-α pGem (see pGem di-alpha, #5a, table 200) into the Eag1 site of the relevant cys mutant desval a pGem plasmid. The mutant diα gene with the cys mutation in the 5' α gene was constructed by inserting the BstB1 DNA fragment from diα pGem into the BstB1 site of the cys mutant desval a pGem plasmid.

3) Finally each of the mutant diα genes were cloned into the 1.1E4 expression vector as a Sma1-Pst1 fragment.

Transformations into DH5α at each step in the subcloning procedure were carried out as described in the methods of subcloning of the β G83C mutation into 1.1E4 (see below). The presence of the relevant cys mutation in the correct α gene was confirmed by sequencing at each stage in the subcloning procedure. Each of the diα cys mutants in 1.1E4 were transformed into E. coli strain 127, grown in TB complete media and induced with IPTG. Expression of the diα and β proteins was confirmed by SDS-PAGE and Western blot analysis.

EXAMPLE 29 Hypothetical Protocol for the Oxidation of Two SGE1.1 Mono-cys's to Form a Pseudo-Octamer

The surface cysteine mutants (MW=64 kDa) can be oxidized to the disulfide-linked dimer under oxidative conditions. This can be accomplished by stirring a concentrated solution of the expressed protein at pH 8 under pure oxygen at 4° C. or room temperature in the dark. Trace levels of transition metal ions such as CU+2 may be added to level below 1 uM to catalyze the oxidation (1). Formation of the 128 kDa octamer can be monitored by gel filtration. Saturation of the solution with oxygen at elevated pH should minimize autooxidation of recombinant hemoglobin.

An alternative procedure, which may be the preferred method of catalyzing this reaction, involves the use of redox buffers such as reduced and oxidized glutathione, or reduced and oxidized dithiothreitol (2). This catalysis of the reaction through disulfide interchange may be necessary to control trace transition metal catalysis (3). An second, similar approach involves conversion of the surface cysteines in the 65 kDa species to sulfonates before purification (to avoid 128 kDa species formation during purification), followed by conversion to the disulfide-linked 128 kDa species with reduced glutathione (2).

(1) Freedman, R. B. and Hillson, D. A. (1980) "Formation of Disulfide Bonds" IN: The Enzymology of Post Translational Modification of Proteins, Vol. 1, p. 157 ff. (Academic Press).

(2) DiMarchi, R., et al. (1988) Chemical synthesis of human epidermal growth factor (EGF) and human type a transforming growth factor (TGFa) IN: Peptides: Chemistry and Biology (G. R. Marshall, ed.) pp. 202-203 (Leiden:ESCOM).

(3) Creighton, TE (1978) Experimental studies of protein folding and unfolding. Prog. Biophys. Molec. Biol. 33:231-297

EXAMPLE 30 Characterization of Di-Alpha Hemoglobin

We constructed an E. coli expression vector that contains two copies of the α globin gene fused in tandem by a single codon encoding a glycine residue; this created a fusion junction with the sequence Arg(141α1)-Gly-Val(1α2). In this construct a single operon encoding a diα-globin and a β-globin chain is transcribed from a single TAC promoter. Hoffman, S. J., Looker, D. L., Roehrich, J. M., Cozart, P., Durfee, S., Stetler, G., Proc. Nat. Acad. Sci. (USA) 87, 8521-25 (1990). To decrease the oxygen affinity of this engineered Hb we have introduced an additional Asn-108β→Lys mutation. In its naturally occurring form (Hb Presbyterian) this substitution reduced the oxygen affinity and slightly increased the Bohr effect. Moo-Penn, W., Wolff, J., Simon, G., Vacek, J., Jue, H., Johnson, M., FEBS Lett. 92, 56-57 (19978). Upon induction of the Hb operon with IPTG, E. coli cells carrying a plasmid containing the fused α globin gene produce two polypeptides corresponding to β globin and fused diα globin.

To fully assess the separate effects of the α-globin fusion and the Asn-1908β→Lys mutation, we also constructed vectors to express four related proteins: (1) A mutant form of the tetramer that contains Val to Met changes at the amino termini of both α and β chains and the Asn-108β→Lys mutation (rHb1.0); (2) pseudotetrameric diα-haemoglobin with Asn-108β→Lys and amino-termini modified as described above (rHb1.1); (3) a pseudotetramer without the Asn-108β→Lys (rHb0.1); and (4) tetramer with neither the α fusion nor the β globin mutation (rHb0.0). The four forms of recombinant hemoglobin described above were purified from E. coli and analyzed for functionality. The table below summarizes these results and compares them to adult human native haemoglobin (Hb A) purified blood. For analytical methods, see Hoffman, S. J., Looker, D. L., Roehrich, J. M., Cozart, P., Durfee, S., Stetler, G., Proc. Nat. Acad. Sci. (USA) 87, 8521-25 (1990). Wagenbach, M., O'Rourke, K., Vitez, L., Wieczorek, A., Hoffman, S., Durfee, S., Tedesco., J. Stetler, G. Biotechnology, 9, 57-61 (1991). The Bohr coefficient is the slope of the line generated by plotting log P₅₀ as a function of increasing pH (pH 6.8-7.8). Oxygen equilibrium binding data were collected at 20° C. in 50 mM HEPES buffer containing 100 mM NaCl. Also shown are the amino acids present at the amino termini of the α and β-chains, at position 108 of the β-chain and the structure of the α-chain.

The Asn-108β→Lys mutation causes a substantial decrease in oxygen affinity of both the pseudotetramer and the tetramer when compared to wild type human haemoglobin or rHb0.0. The Asn-108β→Lys mutation apparently also causes an increase in the Bohr effect which is consistent with the initial characterization of human Hb Presbyterian. Fusion of the two α globin subunits causes a substantial increase in oxygen affinity and a slight decrease in cooperativity. Finally, the valine to methionine alteration at the amino termini of the α and β subunits results in a decrease in the oxygen affinity that is apparently completely compensated for by the fusion of the two α-globin subunits.

    ______________________________________                                                   WT    1.1     1.0     0.1   0.0                                      ______________________________________                                         Bohr Coefficient                                                                           -0.47   -0.32   -0.35 -0.25 -0.27                                  p50 (torr) (20° C.)                                                                 4.5     17.2    19.8  4.5   7.3                                    Hill Coeff (n max)                                                                         2.90    2.35    2.53  2.04  2.59                                   α-chain                                                                              WT      diα                                                                              WT    diα                                                                            WT                                     α-chain (NH3)                                                                        VL      ML      ML    ML    ML                                     β-chain (β108)                                                                   N       K       K     N     N                                      β-chain (NH3)                                                                         VH      MH      MH    MH    MH                                     ______________________________________                                    

In order to study the effect of these modifications on the structure of Hb we crystallized rHb1.1 in the deoxy form and solved the structure by X-ray crystallography. FIG. 32 shows a symmetry averaged difference Fourier map of rHb1.1 minus deoxy Hb A in the C-terminal region of one a subunit. The peptide linkage between the two α subunits breaks the molecular dyad symmetry of the Hb tetramer. In deoxyhemoglobin crystals with space group P21, the molecular two-fold axis is not a crystallographic two-fold axis, so that the electron density maps outline the structure of the symmetry-related αβ-dimers separately. In FIG. 32 these are superimposed. The filled bonds and single broken bonds represent approximate coordinates of the two symmetry-related configurations of the rHb1.1 α-chain termini: α1-Gly-α2 (open bonds) and α2-Gly-α1 (broken bonds). α1 and α2 refer to two a subunits related by a non-crystallographic dyad axis in each asymmetric unit of p21 deoxyhemoglobin crystals, but not the order in the tandemly duplicated polypeptide. The similarity of the unaveraged difference map in the two regions near Na1α1/HC3α2 and NA1α2/HC3α1 indicates that the two configurations are present at nearly equal occupancies. Positive densities bridging the N- and C-termini indicate that the glycine residue linking the C-terminus of one α chain to the N-terminus of the symmetry related α subunits. The negative and positive densities around a few residues near both N- and C-termini indicate that fusion of the two α subunits slightly constrains the conformation of both termini. The Asn-108β→Lys 5 mg/ml mutation causes substantial structural changes including a shift of the B and G helices shown by paired positive and negative densities along these two helices (not illustrated) but the structural basis of the reduced affinity is not clear.

To test whether or not the fusion of the a subunits extends the intravascular half-life of hemoglobin, we prepared radioactively labelled rHb1.1 and rHb1.0 for in vivo studies in rats. As shown in FIG. 33, rHb1.1 containing fused a subunits shows a significant increase in intravascular half-life compared to its non-fused homologue. More comprehensive toxicology and pharmacokinetic studies in rats and dogs indicate that renal toxicity associated with tetrameric hemoglobin has been eliminated by the fusion of the two α-chains of rHb1.1. These results are consistent with those reported for chemically crosslinked hemoglobins. Bucci, E., Razynska, A., Urbaitis, B., Fronticelli, C., J. Biol. Chem. 264, 6191-95 (1989); Manning, L. R., et al., Proc. Natl. Acad. Sci (USA) 88, 3329-33 (1991). However, in contrast to chemically cross-linked hemoglobins, large amounts of genetically fused diα-haemoglobin can be produced by simple microbial fermentations and purified without the additional steps or chemicals required to produce chemically cross-linked haemoglobins. Our novel engineered Hb therefore provides a most promising way of producing a safe blood substitute.

EXAMPLE 31 Hypothetical Protocol for the Construction of Hemoglobin Molecules Stabilized Against Dimer Formation by Fusion Across the Alpha 1- Beta 2 or Alpha 2- Beta 1 Dimer Interface Region

The currently employed inter-dimer di-alpha fusion between the C terminus of one alpha subunit and the N terminus of the adjacent alpha subunit, represents a successful protein engineering approach to stabilizing hemoglobin against dimer formation. In this case, use was made of the fortunate juxtaposition of the two termini which originate from different dimers. One might also make a di-beta polypeptide, as has been described, or a hemoglobin with both di-alpha and di-beta polypeptide, as has been described, or a hemoglobin with both di-alpha and di-beta linked subunits. Alternatively, one can envision other types of fusion in which the alpha subunit of one alpha/beta dimer is fused to the beta subunit of the other dimer (FIG. 1). In this, two individual, linked polypeptides would dimerize to form the psuedo-tetrameric hemoglobin. This approach is based on the fact that dimerization involves specific, identical pairs of subunits, generally referred to as α1β1 and α2β2.

As an example of this alternative fusion approach, the alpha subunit C terminal residue (Arg 141) of dimer 1 could be fused, either directly or with an intervening fusion sequence, to the N-terminal amino acid of the beta subunit C helix (Tyr 35) of dimer 2. This would create a new C terminal residue at the end of the beta B helix (Val 34) and would leave a "free" piece of polypeptide comprised of the beta A and B helices (residues 1 to 34 inclusive). These alterations would give rise to a protein comprised of alpha subunit helices A through H fused to beta subunit helices C through H. The polypeptide composed of the beta subunit A and B helices would be covalently attached to the protein by introducing a new helix into the molecule. The helix would be designed to span the distance between the beta C terminus (His 146) and the original beta N terminus of helix A (Val 1). Following these changes, the sequence of helices from the N to C terminus of the new protein would be (alpha) A-B-C-E-F'-F-G-H-(beta)-C-D-E-F'-F-G-H-NEW-A-B. The actual arrangement or the fusion regions would require careful design so that new regions of structure did not extend into the dimer-dimer interface region which is critical to cooperativity. Introduction of amino acids containing basic or acidic residues into the molecule at certain positions could allow some restoration of functionally important salt bridges and hydrogen bonds which could be lost as a result of manipulating the normal N and C termini. The above approach could also extend to the production of the entire hemoglobin molecule or individual dimers as single polypeptide chains, although in the latter case this would not be expected to offer stabilization against dimer formation.

For the purpose of providing the potential for disulfide bond formation, a cysteine may be introduced into either the α or β globin domain of the α1β2 pseudodimer.

EXAMPLE 32 Production of Recombinant Hemoglobin in E. Coli

32.1 FERMENTATION

The two liter fermentation procedures described below can be used to obtain material for purification and functionality determinations.

To prepare the fermenter inoculum, 400 μl of seed stock are inoculated into 200 mL of 4.1 g/L KH₂ PO₄, 7.0 g/L KHPO₄, 2.0 g/L (NH₄)₂ SO₄, 1.0 g/L Na₃ Citrate.2H₂ O, 154 mg/L MgSO₄.7H₂ O, up to 230 mg of proline, 0.2% yeast extract, 1% glucose, 300 μl of 20 mg/ml thiamine in sterile-filtered solution, 133 μl of 15 mg/ml tetracycline, and 0.6 ml of a trace metal solution. The trace metal solution contains 25 μg/ml FeCl₃ +6H₂ O, 1.3 μg/ml ZnCl₂, 2.0 μg/ml COCl₂ +6H₂ O, 2 μg/ml Na₂ MoO₄ +2H₂ O, 1.0 μg/ml CaCl₂ +2H₂ O, 2.54 μg/ml Cu(II)S0₄ +5H₂ O, 0.5 μg/ml H₃ BO₃, 1.2 μg/ml MnCl₂ +4H₂ O, and 100 μl/ml HCl dissolved in a 0.5M Na-citrate solution. This culture is allowed to grow at 37° C. on a shaker until an O.D._(600nm) of 0.4-0.6 is achieved. The entire inoculum is then asceptically transferred to a 2-liter fermentor containing 2 g/L KH₂ PO₄, 3.6 g/L KHPO₄, 2.0 g/L (NH₄ )₂ SO₄, 1 mL/L polypropylene glycol-2000, 50 mL/L of 50% glucose, 100 mg/L of thiamine, 9.75 mg/L of tetracycline, 4 mL/L of trace metals, 1.54 g/L MgSO₄.7H₂ O and 3.68 g/L Na₃ Citrate.2H₂ O. The pH is maintained at 6.8 by addition of 15% to 30% NH₄ OH, dissolved oxygen is maintained at or above 30%, and 60% glucose is added throughout the growth period, sufficient to maintain low but adequate levels of glucose in the culture (0.5 g/L-10 g/L). The culture is grown between 25 and 30° C. to an OD₆₀₀ ¹³ 10-40 prior to induction with 10-1000 μM IPTG. Upon induction of hemoglobin synthesis, the E. coli heme biosynthesis is supplemented by addition of hemin, either by addition of the total mass of hemin required at induction, or by periodic addition of hemin dissolved in 50 mM to 1M NaOH (e.g. one third of the total mass of hemin to be added to the fermentor is added at induction, another third is added after 1/4 of the total time after fermentation has elapsed, and the last third is added half-way through the induction period). Total hemin added can range from 50 to 300 mg/L. The fermenter is allowed to continue for 8-12 hours post-induction. At the end of this period, several 1 ml aliquots are removed from the broth for determination of hemoglobin production.

32.2 CULTURE HARVEST BREAKAGE AND LYSATE PREPARATION

Cells can be harvested by centrifugation at 10,000 ×g for 10 minutes or they can be collected by filtration by cross-flow filtration with 0.2 μm membranes (e.g., Millipore Prostak). The cells are washed or resuspended to 30% (w/v) in a 25 mM Na-borate/2 mM EDTA buffer (pH 9.3). Lysozyme (final conc. 0.02 g/L) and protease inhibitor (e.g., 1 mM benzamidine or 50,000 I.U/L aprotonin) are added to the preparation. The suspended cells are allowed to incubate for 30 minutes at 30-31° C., and then are broken by one or more passes through a homogenizer such as a Gaulin Model 30-CD™ Homogenizer operated between 10 and 14 Kpsi or a Microfluidics Corp. Cell Disruptor Microfluidizer™ set at 13 Kpsi. The remainder of the process may be performed either under oxygen or nitrogen. The temperature of the lysate may be adjusted to 40° C. or the solution may be utilized directly.

The lysed cells are then titrated to above pH-6.8, preferably about 8.3, with 5N NaOH. Conductivity is adjusted to 30 Kmhos by addition of NaCl. The broken cell extract is clarified and the cell debris washed with borate buffer containing protease inhibitor (as above) by ultrafiltration. Since the hemoglobin product is soluble, it passes through the filtration membranes.

After clarification all subsequent steps are performed in the cold (<10° C.). The solution may be purified by passage through either a strong cation exchange column (C), followed by a second strong cation exchange column (C) followed by a strong anion exchange column (A), or by passage through C, then A then C, or alternatively, A, C, then A, or finally simply by passage through only one strong cation exchange column followed by A. Below we describe one possible purification scheme. The CCA column purification order is described here.

The solution is oxygenated, then enough 10 mM sodium phosphate buffer is added to the preparation to bring the conductivity down to ≧1800 mmhos. The resulting solution is then titrated to pH 6.7-7.0 with 0.5-5N H₃ PO₄, loaded onto a strong cation exchange column (such as BioRad Macro-Prep 50 S column, IPF Biotechnics S-CPI or S-Spherodex columns, PerSeptive Biosystems Poros™ S or HS/MII columns, a Pharmacia S-Sepharose Fast Flow column, a TosoHaas Toyopearl™ SP-550C column, or Whatman SE52 or SE53 columns) pre-equilibrated with 10 mM sodium phosphate, washed, and then eluted with 14-20 mM sodium phosphate buffer at pH 7.4-7.7. A fraction collector can be used to collect fractions of interest. Alternatively, the solution may be further processed as described below.

A second cation exchange step can be performed. The solution from the first cation exchange step is diluted with an equal volume of deionized water, and the pH is adjusted to 6.8 with 5N phosphoric acid to provide adequate binding conditions for the second cation exchange column. A strong cation exchanger such as those listed above is used. The column is pre-equilibrated with 10 mM sodium phosphate, pH 6.8. After loading, the bound protein on the column is washed with approximately 2 column volumes of the equilibration buffer. A 14 mM sodium phosphate, pH 7.4 wash, is then used to selectively elute the protein. During elution, the column effluent is monitored for total protein by UV absorbance at 280 nm.

Additional purification of recombinant hemoglobin can be achieved by means of an anion exchange column such as BioRad Macro-Prep 50 Q column, PerSeptive BioSystems Poros™ Q column, Pharmacia Q-Sepharose Fast Flow column, TosoHaas Toyopearl™ QAE-550C column, or Whatman BioSystems QA52 or DE53 column. Prior to use, the column is equilibrated with 20 mM tris-HCl buffer, pH 8.3. Material from the second cation exchange elution is diluted or diafiltered with 20 mM tris-HCl buffer, pH 8.3 to allow binding to the chromatography resin. After loading, the column is washed with 2 column volumes of equilibration buffer, and the protein eluted with 30 mM tris-HCl buffer, pH 7.2.

CONCENTRATION/BUFFER EXCHANGE

Protein from the final chromatography step can be concentrated and exchanged into 5 mM sodium phosphate/150 mM sodium chloride, pH 7.3 with a 30,000 molecular weight cutoff ultrafiltration membrane.

Reference Example A: Reconstitution of Recombinant Alpha-Globin and Recombinant Beta-Globin with Heme and Chemical Reduction to Yield Artificial Hemoglobin

Conventional methods of preparing artificial hemoglobin are exemplified by the following procedure.

The lyophilized recombinant alpha and beta-globins (100 mg each) were individually dissolved in 8M urea/50 mM Tris-Cl, pH 8.01/1 mM EDTA/1 mM DTT, diluted to a concentration of 5 mg/ml and incubated at room temperature for 3-4 hours. The alpha-globin was then diluted to 0.3mg/ml with chilled 20 mM K₂ HPO₄, pH 5.7/1 mM EDTA/1 mM DTT. Hemin (25 mg) was dissolved in 2.4mg 0.1M KOH, diluted with an equal volume of 1M KCN; this solution was then made 0.1 mg/ml in hemin and 20 mM K₂ HPO₄, pH 6.7 with stock phosphate buffer. Hemin from this solution was added at a 2.8 molar excess to the chilled alpha-globin; and equal molar amount of beta-globin was added and the solution was dialyzed at 4° C. overnight against 0.1M K₂ HPO₄, pH 7.6/1 mM EDTA/1 mM KCN. The artificial Hb solution was concentrated by ultra-filtration using a PM-10 membrane (Amicon) and transferred into a 200 ml screw-top test tube with a rubber septum. The hemoglobin solution was deoxygenated by evacuation and flushing with N₂, and then the solution was saturated with CO. 100mM sodium dithionite solution was prepared anaerobically in a 20 ml screw-top test tube with rubber septum. 4.5 equivalents of dithionite were added to the Hb solution with a syringe, and the mixture incubated on ice for 15 min. The Hb solution was gel-filtered against 10 mM Na phosphate buffer pH 6.0 on a 4×40 cm Sephadex G-25 (fine) column. The colored solution was then applied to a 2×10 cm-52 (Whatman) column equilibrated with the same buffer and the chromatography was developed with a linear gradient of 500 ml Na phosphate buffer pH 6.0 and 500 ml of 70 mM sodium phosphate buffer pH 6.9. CO was removed from Hb by photolysis under a stream of oxygen. Artificial Hgb prepared this way is isolated in only about 25% yield from the fusion peptides but shows native oxygen binding properties.

Reference Example B: P₅₀ Determination

Our preferred method of measuring P50 of purified hemoglobin solutions for the purpose of the appended claims is as follows:

Hemoglobin-oxygen equilibrium curves are measured using a Hemox Analyzer™ (TCS Medical Products, Southampton, Pa.) at either 25° C. or 37° C.+0.1° C. in 50 mM HEPES buffer/0.1M NaCl, pH 7.4. Oxygen equilibrium curves are measured by N2 deoxygenation of an oxyhemoglobin solution that has been previously equilibrated with water-saturated O₂ (for samples with a P50>10 torr) or with water-saturated compressed air. Absorbance readings at 568 and 558 nm are measured throughout the run for determination of percent oxyhemoglobin in the sample. Percent oxyhemoglobin is directly proportional to log Aλ558/log Aλ568 and is independent of path length. Both the absorbances and the oxygen pressure are sampled by a programmable-gain 12-bit analog-to-digital converter (Labmaster PGH, Scientific Solutions, Solon, Ohio) under computer control. The oxygen equilibrium curve is subjected to a low-pass digital filter. P₅₀ values (partial pressure of O₂ required for 50% saturation of oxygen binding sites) and Hill coefficients (^(n) max) are calculated from the digitally filtered data by using software developed in our laboratory. The Hill coefficients are determined as the maximum slope of the functions dlog y/(1-y)!/dlog p, where y is % O₂ saturation and p is partial pressure of O₂.

P₅₀ may also be measured under other conditions, but it should be noted that many environmental factors affect hemoglobin's oxygen affinity. The effect of PH, CO₂ inorganic anions, organic phosphates and temperature on P₅₀ are discussed in Bunn and Forget, HEMOGLOBIN: MOLECULAR, GENETIC AND CLINICAL ASPECTS 37-47, 95-98 (W. B. Saunders Co., 1986).

Since many determinations of whole blood oxygen binding curves are made under standard physiologic conditions (37° C., pH, 7.4, PCO₂ 40 mm Hg), it may be necessary to adjust literature figures. In this context, it should be noted that a 10° C. increase results in nearly a two-fold increase in P₅₀, while the dependence of P₅₀ on pH is approximately given as delta log P₅₀ /delta pH=-0.5.

Comparing P₅₀ values of purified Hb preparations to P₅₀ values of whole blood can be problematic. Whole blood, or isolated RBC's contain many components that naturally modulate the shape of the hemoglobin-oxygen binding curve. The RBC encapsulates Hgb in the presence of a high concentration of the effector molecule 2.3-DPG; a molecule that causes Hgb to have a markedly lower affinity for O₂. Other intra-erythrocyte components also affect the shape of the binding curve: ATP, Cl-CO₂, H+, orthophosphate, methemoglobin and carboxyhemoglobin. The levels of these substances may vary with age, sex and condition. These substances are not normally present in purified HgB solutions and thus, the P₅₀ value of purified Hgb is lower than that found in whole blood. One very important modulator of Hgb-oxygen affinity is C1- ion. C1 ion is found outside the erythrocyte in the blood serum at a physiologic concentration of approximately 0.15M. Since C1- causes a lower O₂ affinity, a Hgb solution with a P50 measured in vitro may well have much lower O₂ affinity if infused into the blood stream. Another problem with measuring O₂ binding of whole blood is that RBCs are quite fragile and in the process of manipulating the erythrocyte into the instrument used to measure the O₂ binding it is inevitable that at least a small percentage of the RBCs will lyse. Lysed RBCs leak Hgb into the surrounding media away from 2,3-DPG; hence, since free Hgb has a higher affinity than intraerythrocyte Hgb, lysed RBCs will have a higher O₂ affinity and can cause a falsely low P₅₀ value for whole blood P₅₀ determinations. It is widely accepted that under physiologic conditions while blood has a P₅₀ value of 26-28 mm Hg. When Hgb is isolated from whole blood, however, the measured P₅₀ is on the order of 1-10 mm Hg depending on the investigator's experimental conditions. For these reasons it is most accurate to measure Hgb-oxygen equilibria with purified Hgb molecules under strict conditions of buffer, pH and salt concentrations. Unfortunately, there are no accepted "standards" for all investigators to measure Hgb oxygen binding for in vitro systems.

Still, as many mutant hemoglobins are first identified in patient's whole blood, one would like to be able to compare the relative affinities of native and mutant Hgb for O2, between whole blood and purified Hgb preparations. An example of this is Hgb Chico (beta lys⁶⁶ -thr). If one examined only the P₅₀ value of the purified mutant Hgb (10.1 mmHg) one would note that Hgb has a P₅₀ value less than that for normal whole blood (27.2 mmHg). Still, when that hemoglobin is measured in RBCs under physiologic conditions it is apparent that it does have a higher P₅₀ than normal whole blood (38 mmHg). One cannot predict the degree that the P50 value will change going from whole blood Chico to a purified Hgb Chico if it were infused into the bloodstream as a blood substitute. One can conclude however, that the P₅₀ will be higher than it is in pure form, and that by reacting the mutant Hgb with organic phosphates that P₅₀ will be even higher.

                  TABLE 1                                                          ______________________________________                                         PRIMARY STRUCTURE OF HUMAN GLOBIN SUBUNITS                                     Helix                                                                                α                                                                               Zeta   Helix*                                                                                β                                                                              δ                                                                              Gamma  ε                         ______________________________________                                         NA1   1 Val  Ser    NA1    1 Val                                                                               Val   Gly    Val                                                   NA2    2 His                                                                               His   His    His                               NA2   2 Leu  Leu    NA3    3 Leu                                                                               Leu   Phe    Phe                               A1    3 Ser  Thr    A1     4 Thr                                                                               Thr   Thr    Thr                               A2    4 Pro  Lys    A2     5 Pro                                                                               Pro   Glu    Ala                               A3    5 Ala  Thr    A3     6 Glu                                                                               Glu   Glu    Glu                               A4    6 Asp  Glu    A4     7 Glu                                                                               Glu   Asp    Glu                               A5    7 Lys  Arg    A5     8 Lys                                                                               Lys   Lys    Lys                               A6    8 Thr  Thr    A6     9 Ser                                                                               Thr   Ala    Ala                               A7    9 Asn  Ile    A7     10 Ala                                                                              Ala   Thr    Ala                               A8    10 Val Ile    A8     11 Val                                                                              Val   Ile    Val                               A9    11 Lys Val    A9     12 Thr                                                                              Asn   Thr    Thr                               A10   12 Ala Ser    A10    13 Ala                                                                              Ala   Ser    Ser                               A11   13 Ala Met    A11    14 Leu                                                                              Leu   Leu    Leu                               A12   14 Trp Trp    A12    15 Trp                                                                              Trp   Trp    Trp                               A13   15 Gly Ala    A13    16 Gly                                                                              Gly   Gly    Ser                               A14   16 Lys Lys    A14    17 Lys                                                                              Lys   Lys    Lys                               A15   17 Val Ile    A15    18 Val                                                                              Val   Val    Met                               A16   18 Gly Ser                                                               AB1   19 Ala Thr                                                               B1    20 His Gln    B1     19 Asn                                                                              Asn   Asn    Asn                               B2    21 Ala Ala    B2     20 Val                                                                              Val   Val    Val                               B3    22 Gly Asp    B3     21 Asp                                                                              Asp   Glu    Glu                               B4    23 Glu Thr    B4     22 Glu                                                                              Ala   Asp    Glu                               B5    24 Tyr Ile    B5     23 Val                                                                              Val   Ala    Ala                               B6    25 Gly Gly    B6     24 Gly                                                                              Gly   Gly    Gly                               B7    26 Ala Thr    B7     25 Gly                                                                              G1y   Gly    Gly                               B8    27 Glu Glu    B8     26 Glu                                                                              Glu   Glu    Glu                               B9    28 Ala Thr    B9     27 Ala                                                                              Ala   Thr    Ala                               B10   29 Leu Leu    B10    28 Leu                                                                              Leu   Leu    Leu                               B11   30 Glu Glu    B11    29 Gly                                                                              Gly   Gly    Gly                               B12   31 Arg Arg    B12    30 Arg                                                                              Arg   Arg    Arg                               B13   32 Met Leu    B13    31 Leu                                                                              Leu   Leu    Leu                               B14   33 Phe Phe    B14    32 Leu                                                                              Leu   Leu    Leu                               B15   34 Leu Leu    B15    33 Val                                                                              Val   Val    Val                               B16   35 Ser Ser    B16    34 Val                                                                              Val   Val    Val                               C1    36 Phe His    C1     35 Tyr                                                                              Tyr   Tyr    Tyr                               C2    37 Pro Pro    C2     36 Pro                                                                              Pro   Pro    Pro                               C3    38 Thr Gln    C3     37 Trp                                                                              Trp   Trp    Trp                               C4    39 Thr Thr    C4     38 Thr                                                                              Thr   Thr    Thr                               C5    40 Lys Lys    C5     39 Gln                                                                              Gln   Gln    Gln                               C6    41 Thr Thr    C6     40 Arg                                                                              Arg   Arg    Arg                               C7    42 Tyr Tyr    C7     41 Phe                                                                              Phe   Phe    Phe                               CE1   43 Phe Phe    CD1    42 Phe                                                                              Phe   Phe    Phe                               CE2   44 Pro Pro    CD2    43 Glu                                                                              Glu   Asp    Asp                               CE3   45 His His    CD3    44 Ser                                                                              Ser   Ser    Ser                               CE4   46 Phe Phe    CD4    45 Phe                                                                              Phe   Phe    Phe                                                   CD5    46 Gly                                                                              Gly   Gly    Gly                               CE5   47 Asp Asp    CD6    47 Asp                                                                              Asp   Asn    Asn                               CE6   48 Leu Leu    CD7    48 Leu                                                                              Leu   Leu    Leu                               CE7   49 Ser His    CD8    49 Ser                                                                              Ser   Ser    Ser                               CE8   50 His Pro    D1     50 Thr                                                                              Ser   Ser    Ser                                                   D2     51 Pro                                                                              Pro   Ala    Pro                                                   D3     52 Asp                                                                              Asp   Ser    Ser                                                   D4     53 Ala                                                                              Ala   Ala    Ala                                                   D5     54 Val                                                                              Val   Ile    Ile                                                   D6     55 Met                                                                              Met   Met    Leu                               CE9   51 Gly Gly    D7     56 Gly                                                                              Gly   Gly    Gly                               E1    52 Ser Ser    E1     57 Asn                                                                              Asn   Asn    Asn                               E2    53 Ala Ala    E2     58 Pro                                                                              Pro   Pro    Pro                               E3    54 Gln Gln    E3     59 Lys                                                                              Lys   Lys    Lys                               E4    55 Val Leu    E4     60 Val                                                                              Val   Val    Val                               E5    56 Lys Arg    E5     61 Lys                                                                              Lys   Lys    Lys                               E6    57 Gly Ala    E6     62 Ala                                                                              Ala   Ala    Ala                               E7    58 His His    E7     63 His                                                                              His   His    His                               E8    59 Gly Gly    E8     64 Gly                                                                              Gly   Gly    Gly                               E9    60 Lys Ser    E9     65 Lys                                                                              Lys   Lys    Lys                               E10   61 Lys Lys    E10    66 Lys                                                                              Lys   Lys    Lys                               E11   62 Val Val    E11    67 Val                                                                              Val   Val    Val                               E12   63 Ala Val    E12    68 Leu                                                                              Leu   Leu    Leu                               E13   64 Asp Ala    E13    69 Gly                                                                              Gly   Thr    Thr                               E14   65 Ala Ala    E14    70 Ala                                                                              Ala   Ser    Ser                               E15   66 Leu Val    E15    71 Phe                                                                              Phe   Leu    Phe                               E16   66 Thr Gly    E16    72 Ser                                                                              Ser   Gly    Gly                               E17   68 Asn Asp    E17    73 Asp                                                                              Asp   Asp    Asp                               E18   69 Ala Ala    E18    74 Gly                                                                              Gly   Ala    Ala                               E19   70 Val Val    E19    75 Leu                                                                              Leu   Ile, Thr                                                                              Ile                               E20   71 Ala Lys    E20    76 Ala                                                                              Ala   Lys    Lys                               EF1   72 His Ser    EF1    77 His                                                                              His   His    Asn                               EF2   73 Val Ile    EF2    78 Leu                                                                              Leu   Leu    Met                               EF3   74 Asp Asp    EF3    79 Asp                                                                              Asp   Asp    Asp                               EF4   75 Asp Asp    EF4    80 Asn                                                                              Asn   Asp    Asn                               EF5   76 Met Ile    EF5    81 Leu                                                                              Leu   Leu    Leu                               EF6   77 Pro Gly    EF6    82 Lys                                                                              Lys   Lys    Lys                               EF7   78 Asn Gly    EF7    83 Gly                                                                              Gly   Gly    Pro                               EF8   79 Ala Ala    EF8    84 Thr                                                                              Thr   Thr    Ala                               F1    80 Leu Leu    F1     85 Phe                                                                              Phe   Phe    Phe                               F2    81 Ser Ser    F2     86 Ala                                                                              Ser   Ala    Ala                               F3    82 Ala Lys    F3     87 Thr                                                                              Gln   Gln    Lys                               F4    83 Leu Leu    F4     88 Leu                                                                              Leu   Leu    Leu                               F5    84 Ser Ser    F5     89 Ser                                                                              Ser   Ser    Ser                               F6    85 Asp Glu    F6     90 Glu                                                                              Glu   Glu    Glu                               F7    86 Leu Leu    F7     91 Leu                                                                              Leu   Leu    Leu                               F8    87 His His    F8     92 His                                                                              His   His    His                               F9    88 Ala Ala    F9     93 Cys                                                                              Cys   Cys    Cys                               FG1   89 His Tyr    FG1    94 Asp                                                                              Asp   Asp    Asp                               FG2   90 Lys Ile    FG2    95 Lys                                                                              Lys   Lys    Lys                               FG3   91 Leu Leu    FG3    96 Leu                                                                              Leu   Leu    Leu                               FG4   92 Arg Arg    FG4    97 His                                                                              His   His    His                               FG5   93 Val Val    FG5    98 Val                                                                              Val   Val    Val                               G1    94 Asp Asp    G1     99 Asp                                                                              Asp   Asp    Asp                               G2    95 Pro Pro    G2    100 Pro                                                                              Pro   Pro    Pro                               G3    96 Val Val    G3    101 Glu                                                                              Glu   Glu    Glu                               G4    97 Asn Asn    G4    102 Asn                                                                              Asn   Asn    Asn                               G5    98 Phe Phe    G5    103 Phe                                                                              Phe   Phe    Phe                               G6    99 Lys Lys    G6    104 Arg                                                                              Arg   Lys    Lys                               G7   100 Leu Leu    G7    105 Leu                                                                              Leu   Leu    Leu                               G8   101 Leu Leu    G8    106 Leu                                                                              Leu   Leu    Leu                               G9   102 Ser Ser    G9    107 Gly                                                                              Gly   Gly    Gly                               G10  103 His His    G10   108 Asn                                                                              Asn   Asn    Asn                               G11  104 Cys Cys    G11   109 Val                                                                              Val   Val    Val                               G12  105 Leu Leu    G12   110 Leu                                                                              Leu   Leu    Met                               G13  106 Leu Leu    G13   111 Val                                                                              Val   Val    Val                               G14  107 Val Val    G14   112 Cys                                                                              Cys   Thr    Ile                               G15  198 Thr Thr    G15   113 Val                                                                              Val   Val    Ile                               G16  109 Leu Leu    G16   114 Leu                                                                              Leu   Leu    Leu                               G17  110 Ala Ala    G17   115 Ala                                                                              Ala   Ala    Ala                               G18  111 Ala Ala    G18   116 His                                                                              Arg   Ile    Thr                               G19  112 His Arg    G19   117 His                                                                              Asn   His    His                               GH1  113 Leu Phe    GH1   118 Phe                                                                              Phe   Phe    Phe                               GH2  114 Pro Pro    GH2   119 Gly                                                                              Gly   Gly    Gly                               GH3  115 Ala Ala    GH3   120 Lys                                                                              Lys   Lys    Lys                               GH4  116 Glu Asp    GH4   121 Glu                                                                              Glu   Glu    Glu                               GH5  117 Phe Phe    GH5   122 Phe                                                                              Phe   Phe    Phe                               H1   118 Thr Thr    H1    123 Thr                                                                              Thr   Thr    Thr                               H2   119 Pro Ala    H2    124 Pro                                                                              Pro   Pro    Pro                               H3   120 Ala Glu    H3    125 Pro                                                                              Gln   Glu    Glu                               H4   121 Val Ala    H4    126 Val                                                                              Met   Val    Val                               H5   122 His His    H5    127 Gln                                                                              Gln   Gln    Gln                               H6   123 Ala Ala    H6    128 Ala                                                                              Ala   Ala    Ala                               H7   124 Ser Ala    H7    129 Ala                                                                              Ala   Ser    Ala                               H8   125 Leu Trp    H8    130 Tyr                                                                              Tyr   Trp    Trp                               H9   126 Asp Asp    H10   131 Gln                                                                              Gln   Gln    Gln                               10   127 Lys Lys    H10   132 Lys                                                                              Lys   Lys    Lys                               H11  128 Phe Phe    H11   133 Val                                                                              Val   Met    Leu                               H12  129 Leu Leu    H12   134 Val                                                                              Val   Val    Val                               H13  130 Ala Ser    H13   135 Ala                                                                              Ala   Thr    Ser                               H14  131 Ser Val    H14   136 Gly                                                                              Gly   Gly, Ala                                                                              Ala                               H15  132 Val Val    H15   137 Val                                                                              Val   Val    Val                               H16  133 Ser Ser    H16   138 Ala                                                                              Ala   Ala    Ala                               H17  134 Thr Ser    H17   139 Asn                                                                              Asn   Ser    Ile                               H18  135 Val Val    H18   140 Ala                                                                              Ala   Ala    Ala                               H19  136 Leu Leu    H19   141 Leu                                                                              Leu   Leu    Leu                               H20  137 Thr Thr    H20   142 Ala                                                                              Ala   Ser    Ala                               H21  138 Ser Glu    H21   143 His                                                                              His   Ser    His                               HC1  139 Lys Lys    HC1   144 Lys                                                                              Lys   Arg    Lys                               HC2  140 Tyr Tyr    HC2   145 Tyr                                                                              Tyr   Tyr    Tyr                               HC3  141 Arg Arg    HC3   146 His                                                                              His   His    His                               ______________________________________                                    

                  TABLE 2: -                                                       Oligonucleotide Sequences Used to Create Di-Alpha Globin                       ______________________________________                                         Genes                                                                          α.sup.1 (Arg)GlyMet(Leu)α.sub.2 Linker                              ##STR10##                                                                     α.sup.1 (Arg)GlyGly(Val)α.sub.2 Linker                              ##STR11##                                                                     α.sup.1 (Arg))Gly(Val)α.sub.2 Linker                                ##STR12##                                                                     α.sup.1 (Arg)GlyGly(Leu)α.sub.2                                     ##STR13##                                                                     α.sup.1 (Arg)(Val)α.sub.2                                           ##STR14##                                                                     ______________________________________                                    

                                      TABLE 3                                      __________________________________________________________________________     Oligonucleotide Sequences to Create Beta Globin Mutations                      __________________________________________________________________________     β.sup.67 val-->ile                                                        Ncol                                                                            ##STR15##                                                                     CATTTTTTtagGACCCACGAAAGAGACTGCCAGACCGAGTGGACCTGTTGGACTTTC                      .sup.β67 val-->ile;.sup.β82 lys-->arg                                Ncol                                                                            ##STR16##                                                                     CATTTTTTtagGACCCACGAAAAGAGACTGCCAGACCAGTGGACCTGTTGGACgcaC                      β.sup.102 asn-->thr                                                       Sacl                                                                            ##STR17##                                                                     TCGAGGTGACGCTGTTTGACGTGCAACTGGGCCTTtggAAGGCAGACGACCCATTGCATGATC                __________________________________________________________________________

                  TABLE 4                                                          ______________________________________                                         P.sub.50 Values for Hemoglobin-Like Proteins (37° C.)                   Hemoglobin                                                                     ______________________________________                                         des--val Hgb         13.3                                                      dialpha (arg--gly--met) Hgb                                                                         9.0                                                       dialpha beta.sup.67 val-->ile Hgb                                                                   24.7                                                      dialpha (arg--gly--gly--val) Hgb                                                                    9.0                                                       ______________________________________                                    

                                      TABLE 5                                      __________________________________________________________________________     Synthetic Oligonucleotides Used For The Synthesis Of pGAP                      __________________________________________________________________________     1 TCGACTGAAA                                                                             AAAAAGGTTT                                                                             AAACCAGTTC                                                                             CCTGAAATTA                                                                             TTCCCCTACT                                     TGACTAATAA                                                                             GTATATAAAG                                                           2 CAATACCTAC                                                                             CGTTTATATA                                                                             CTTATTAGTC                                                                             AAGTAGGGGA                                                                             ATAATTTCAG                                     GGAACTGGTT                                                                             TAAACCTTTT                                                                             TTTTCAG                                                      3 ACGGTAGGTA                                                                             TTGATTGTAA                                                                             TTCTGTAAAT                                                                             CTATTTCTTA                                                                             AACTTCTTGA                                     ATTCTACTTT                                                                             TATAGTTAGT                                                                             CTTTTTTTTA                                                                             GTTTT                                                4 AAGTTCTTGG                                                                             TGTTTTAAAA                                                                             CTAAAAAAAA                                                                             GACTAACTAT                                                                             AAAAGTAGAA                                     AGAAGTTTAA                                                                             GAAATAGATT                                                                             TACAGAATTA                                                                             CAAT                                                 5 AAAACACCAA                                                                             GAACTTAGTT                                                                             TCGAATAAAC                                                                             ACACATAAAT                                                                             AAACCATGGT                                     TAACT                                                                        6 CTAGAGTTAA                                                                             CCATGGTTTA                                                                             TTTATGTGTG                                                                             TTTATTCGAA                                                                             ACT                                          __________________________________________________________________________

                                      TABLE 6                                      __________________________________________________________________________     Synthetic Oligonucleotides Used For The Synthesis                              Of The Galactose Upstream Activator                                            __________________________________________________________________________     1 CGTACGGATTAGAAGCCGCCGAGCGGGTGACAGCCCTCCGAAGGAAGACTC                            TCCTCCGTGCGTCCTCGTC TTCACCGGTCGC                                             2 AGGACGCACGGAGGAGAGTCTTCCTTCGGAGGGCTGTCACCCGCTCGGGG                             CTTCTAATCCGTACGCATG                                                          3 GTTCCTGAAACGCAGATGTGCCTCGCGCCGCACTGCTCCGAACAAT                                 AAAGATTCTACAATACTAGCTTTT ATGGTTATGAAGAGGAAAAT                                4 ATAACCATAAAAGCTAGTATTGTAGAATCTTTATTGTTCGGAGCAGTGCG                             GCGCGAGGCACATCTGCGTT TCAGGAACGCGACCGGTGAAGAC                                 5 TGGCAGTAACCTGGCCCCACAAACCTCAAATGAACGAAATCAAATTA                                ACAACCAGATATC                                                                6 TCGAGATATCTGGTTGTTAATTTGATTCGTTCATTTGAGGTTTGTGG                                GGCCAGGTTACTGCCAATTTTCCTCTTC                                                 __________________________________________________________________________

                  TABLE 7                                                          ______________________________________                                         Codon Preferences in Yeast                                                     ______________________________________                                         Ala            GCU, GCC                                                        Ser            UCU, UCC                                                        Thr            ACU, ACC                                                        Val            GUU, GUC                                                        Ile            AUU, AUC                                                        Asp            GAC                                                             Phe            UUC                                                             Tyr            UAC                                                             Cys            UGU                                                             Asn            AAC                                                             His            CAC                                                             Arg            AGA                                                             Glu            GAA                                                             Leu            UUG                                                             Lys            AAG                                                             Gly            GGU                                                             Gln            CAA                                                             Pro            CCA                                                             Met            AUG (No alternative codons)                                     Trp            UGG (No alternative codons)                                     ______________________________________                                          Source: Bennetzen and Hall, J. Biol. Chem., 257:3026-31 (1982).          

                  TABLE 8                                                          ______________________________________                                         OLIGONUCLEOTIDES USED FOR THE CONSTRUCTION OF A                                GENE ENCODING A TANDEM ALPHA-GLOBIN DIMER.                                     ______________________________________                                         AL-1SS                                                                         5'-tgcacgcttctttggacaagttcttggcttctgtttctactgtgttaactagtaagt                   acagaggtggtgttttgtctcctgcagacaagactaac-3'                                      AL-2SS                                                                         5'-gttaaggctgcttggggtaaggttggtgctcacgctggtgaatacggtgctgaagcttt                 ggaaaggatgttcttgtct-3'                                                         AL-1AS                                                                         5'-tgcaggagacaaaacaccacctctgtacttactagttaacacagtagaaacagaagcca                 agaacttgtccaaagaagcg-3'                                                        AL-2AS                                                                         5'-ggaaagacaagaacatcctttccaaagcttcagcaccgtattcacccagcgtgagcacc                 aaccttaccccaagcagccttaacgttagtcttgtc-3'                                        ______________________________________                                          Note: NcoIALPHA-1-ApaL1; FoK1ALPHA-2-Sal1; ApaLIRGGV-Fok1                

                  TABLE 9                                                          ______________________________________                                         Oxygen Affinity of Recombinant Mutant Hemoglobin                               Generated From FX-Hgb and Tryptic Digestion                                    Oxygen affinity measured at 37° C. in 50 mM Bis-Tris                    (as opposed to 50 mM HEPES buffer) pH 7.4, 0.1M NaCl on a                      Hemox-Analyzer. Solutions were 60 uM in heme and measured                      between 130 and 1.2 torr oxygen tension.                                                       P.sub.50                                                       ______________________________________                                         HgB A.sub.o       9.5                                                          rHgB A.sub.o      9.2                                                          HgB Providence    10.2                                                         Hgb Kansas        11.3                                                         HgB (beta.sup.67 val-->ile)                                                                      22.4                                                         ______________________________________                                          The value for Hgb A.sub.o is of course for free hemoglobin in solution.        The P.sub.50 of whole blood is much higher.                              

                  TABLE 10                                                         ______________________________________                                         Effects of NaCl and Inositol Hexaphosphate on Oxygen                           Binding to Hemoglobin A.sub.o and Recombinant des-Fx Hgb                              P.sub.50      P.sub.50                                                         0.1M NaCl                                                                              0M NaCl   2.2 mM IHP                                                                               0M 1HP                                      ______________________________________                                         Hgb A.sub.o                                                                             6.6       2.8       51.1    6.6                                       des-Fx Hgb                                                                              4.9       3.9       5.5     4.9                                       ______________________________________                                    

                  TABLE 11                                                         ______________________________________                                         Distribution of FX-Alpha, FX-Beta and FX-Hgb in E. coli.                                    milligrams of protein                                                          per OD-liter of E. coli                                                        Soluble                                                                              Insoluble                                                   ______________________________________                                         FX-Alpha       36      0                                                       FX-Beta        21      21                                                      FX-Hgb         188     0                                                       ______________________________________                                    

                  TABLE 100                                                        ______________________________________                                         FERM DATA                                                                                    IN-                            RHGB                                            DUCT    FINAL MG DI-                                                                               MG/  MG    MG/                               RFE  STRAIN   TEMP    OD    A + B OD-L RHGB  OD-L                              ______________________________________                                          1   BL21     30.0    6.6              0.0   0.0                                5   BL21HA   37.0    9.3              0.0   0.0                                6   JM109    37.0    7.1   234.0 16.5 5.5   0.6                                7   JM109    37.0    11.3  372.0 16.5 25.0  1.1                                8   JM109    37.0    6.6   442.0 33.5 49.0  3.7                               10   SCS1     37.0    15.2  754.0 24.8 38.0  1.2                               11   JM109    25.0    11.5             26.0  1.1                               12   JM109    25.0    9.5              31.0  1.6                               13   JM109    30.0    9.0              64.0  3.5                               17   JM109    37.0    14.4             51.2  1.8                               25   JM110    30.0    20.5             25.0  0.6                               26   LE392    37.0    23.0  1040.0                                                                               22.6 4.1   0.1                               27   JM110    30.0    27.0  334.0 6.2  47.0  0.9                               30   JM110    37.0    23.6             16.0  0.3                               31   23722    30.0    25.4             59.0  1.2                               35   W3110    30.0    18.6             13.0  0.3                               38   AG-1     30.0    13.0             57.0  2.1                               49   DH-1     30.0    4.0              7.7   0.9                               50   NM554    30.0    14.0             56.0  2.0                               51   NM554    37.0    14.5             18.0  0.6                               ______________________________________                                    

                  TABLE 200                                                        ______________________________________                                         Bacterial and Yeast Vectors                                                    DEFINITIONS:                                                                   ______________________________________                                         ROP:          Gene which regulates plasid copy number                          ROP+:         Low copy number                                                  ROP-:         High copy number                                                 AR:           ampicillin resistance used for plasmid                                         selection                                                        TR:           tetracycline resistance used for plasmid                                       selection                                                        TS:           tetracycline sensitive, TR gene not                                            functional                                                       E4            tetracycline resistance used for plasmid                                       selection                                                        E5            tetracycline sensitive, TR gene not                                            functional                                                       FX-A:         FX-alpha globin gene                                             FX-B:         FX-beta globin gene                                              DFX-A:        Des-FX alpha globin gene                                         DFX-B:        Des-FX beta globin gene                                          DV-A:         Des-Val alpha globin gene                                        DV-B:         Des-Val beta globin gene                                         RV-Di-alpha:  Di-alpha gene containing no amino acid                                         spacer (R = Arginine; V--Valine)                                 RGV-Di-alpha: ADi-alpha globin gene containing a                                             single glycine (G) linker followed by a                                        valine                                                           RGM-Di-alpha: Di-alpha globin gene containing a                                              single glycine (G) linker followed by a                                        methionine (M)                                                   RGGV-Di-alpha:                                                                               Di-alpha globin gene containing a two                                          glycine (G) linker followed by a                                               valine                                                           RGGGV-Di-alpha:                                                                              Di-alpha globin gene containing a                                              three glycine (G) linker followed by                                           a valine                                                         RPV-di-alpha  Di-alpha gene fused by a single proline                                        (P) (yeast only)                                                 LACI:         gene encoding a repressor which regulates                                      the TAC promoter                                                 LAC+:         Lac repressor gene on plasmid                                    LAC-:         no Lac repressor gene on plasmid                                 ______________________________________                                    

All bacterial plasmids listed below which contain alpha, di-alpha and/or beta genes also have translational couplers. The pPL expression system translationally couples the lambda N protein gene to a globin gene. The designations "E4" and "E5" appear in the names of certain plasmids, and have the meanings given above.

pKK223-3

Parental plasmid obtained commercially from Pharmacia LKB, 800 Centennial Ave., P.O. Box 1327 Piscataway, N.J. 08855-1327; all other plasmids derived from this parental plasmid. pKK223-3 has a TAC promoter followed by a poly-restriction site region to facilitate gene insertion.

AR, TS, ROP+, LAC-

1. pDL II-62m

pKK223-3 containing FX-A

AR, TS, ROP+, LAC-

2. pDL II-10a

pKK223-2 containing FX-B

AR, TS, ROP+, LAC-

3. PDL II-66A

Parental plasmids are (1) and (2); contains both FX-A and FX-B in single operon

AR, TS, ROP+, LAC-

4. PGEM FX-A

Parental plasmids are (1) and pGem1 which is commercially available from Promega Corporation, 2800 Woods Hollow Rd., Madison, Wis. 53711.

5. pGEM FX-B

pGEM containing FX-B, AR Parental plasmids are (2) and pGEM1

5a. pGEM di-alpha

Parentals are pGEM1 (see discussion of #4) and (2). The SmaI/PstI fragment of (29), containing the di-alpha gene, is excised and ligated into SmaI/PstI-cut pGEM1.

6. pDL II-83a

Parental is 4; contains DFX-A

7. pDL III-6f

Parental is 5; contains DFX-B, AR

8. pDL II-86c

Parentals are pKK223-3 and (6), contains DFX-A,

AR, TS, ROP+, LAC-

9. pDL III-13e

Parentals are (7) and (8), pKK223-3 containing both DFX-A and DFX-B,

AR, TS, ROP+, LAC-

10. pDL II-91f

Parental is (4) contains DV-A, AR

11. pDL II-95a

Parental is (7) contains DV-B, AR

12. pDL III-1a

Parentals are pKK223-3 and (10), contains DV-A

AR, TS, ROP+, LAC-

13. pDL III-14c

Parentals are (11) and (12), contains DV-A and DV-B,

AR, TS, ROP+, LAC-

13a. pDL III-38b

Parentals are (11) and (23)

AR, TS, ROP+, LAC-

14. pDL III-47a

Parental is (13), contains RGM-di-alpha and DV-B,

AR, TS, ROP+, LAC-

15. pDL III-82A

Parental is (13), contains RGGV-di-alpha and DV-B,

AR, TS, ROP+, LAC-

16. pDL IV-8a

Parental is (13), contains RGV-di-alpha and DV-B,

AR, TS, ROP+, LAC-

17. pDL IV-47b

Parental is (13), contains RV-di-alpha and DV-B

AR, TS, ROP+, LAC-

18. pDL IV-66a

Parental is (13), contains RGGGV-di-alpha and DV-B,

AR, TS, ROP+, LAC-

19. pDL IV-3a

Parental is (15), ROP gene is inactivated by insertion of a Not I linker into the PvuII site within the ROP gene

AR, TS, ROP-, LAC-

20. pDL IV-38a

Parental is (15), contains the Nagai mutation in DV-B,

AR, TS, ROP+, LAC-

21. pDL IV-58f

Parental is (20), ROP gene inactivated as in (19),

AR, TS, ROP-, LAC-

22. pDL IV-59a

Parental is (21) and pBR322, which is commercially available from a number of different suppliers. Contains a functional TR gene constructed in the following manner:

The EcoR1 site of pBR322 was changed to a BamH1 linker. This permitted the removal of the 5' end 5' end of the TR gene from pBR322 as a BamH1 fragment. This fragment was then inserted into the BamH1 site located at the junction of the TAC promoter and the inactive TR gene of pKK223-3. Insertion of this fragment reactivates the TR gene if the fragment inserts in the proper orientation. Selection of colonies on tetracycline plates assures presence of the fragment in the proper orientation.

AR, TR, ROP-, LAC-

23. pJR V-83a

Parental is (11), contains DV-B with the Presbyterian mutation (asn108->lys),

AR, TS, ROP+, LAC-

24. pJR VI-29a

Parentals are (15) and (23), contains RGGV-di-alpha and DV-B with Presbyterian mutation,

AR, TS, ROP+, LAC-

25. pJR VI-53b

Parental is (24), made TR by insertion of BAMH1 fragment,

AR, TR, ROP+, LAC-

26. pJR VI-61a

Parental is (25), made ROP- by insertion of Not I linker into PvuII site

AR, TR, ROP-, LAC-

27. pDL V-4a

Parentals are (16) and (26), contains RGV-di-alpha and DV-B with Presbyterian mutation AR, TS, ROP-, LAC-

28. pDL V-10a

Parental is (27), insertion of BamH1 fragment to convert to TR

AR, TR, ROP-, LAC-

29. pDL V-16d (also named pSGE1.1E4 or simply pSGE1.l)

Parental is (28), contains LACI gene inserted into Not I site. LACI gene obtained using following protocol:

Polymerase chain reaction (PCR) primers containing NOTI sites at their 5' ends were used to amplify the LACI gen. Following gel purification, the gene was inserted into the NOT I site of pDL V-1a

AR, TR, ROP-, LAC+

Several other plasmid constructs have been designed to facilitate the incorporation of a second beta globin gene under regulation of its own TAC promoter.

30. pDL IV-64a

Parental is (14), contains beta globin under regulation of a synthetic TAC promoter AR, TS, ROP+, LAC-

31. pDL IV-67a

Parental plasmids are (14) and (30), contains di-alpha under regulation of one pTAC, and DV-B under regulation of a second pTAC, DV-B is adjacent to di-alpha

AR, TS, ROP-, LAC-

32. pJR VI-54a

Parental plasmids are (14) and (30), contains di-alpha and DV-B under regulation of one pTAC and a second DV-B under regulation of another pTAC. The second DV-B is inserted into the Pvu II site of the plasmid

AR, TS, ROP-, LAC-

33. pPL Lambda,

Commercially available plasmid from Pharmacia LKB (see above); contains pL promoter and coding region for N protein of lambda which can be used for expression of fusion or translationally coupled recombinant proteins.

34. pPL-alpha/beta

Parental plasmids are (13) and (33), contains DV-A and DV-B

AR, ROP+

35. pPL-dialpha/beta

Parental is (34), contains RGV-di-alpha and DV-B

AR, TS, ROP+, LAC-

36. pSGE0.11-L0

Parental plasmid is (35), ROP gene inactivated by insertion of NotI linker into PvuII site in ROP gene

AR, ROP-

37. pSGE0.1E4

Parentals are pSGE1.1E4 (29) and pDL II-95a (11), contains di-alpha globin gene followed by wild type beta globin gene. The wild type beta globin gene from pDL II-95a PstI and HindIII, gel purified, and ligated into pSGE1.1E4 from which the beta Presbyterian gene had been removed with the same restriction enzymes.

38. pSGE0.0E4

Parentals are pSGE0.1 (37) and pDL II-91f (10), contains di-alpha globin gene followed by wild type beta globin gene. The alpha globin gene from pDL II-91f was excised with SmaI and PstI, gel purified, and ligated into pSGE0.1E4 from which the di-alpha globin gene removed using the same restriction enzymes.

39. pPL lambda+TC

Parental is pPL lambda (33). pPL lambda was linearized with the restriction enzyme HpaI and synthetic oligonucleotides encoding the translational coupler region, and a portion of the 5' coding region of alpha globin ligated into the HpaI site.

40. pSGE1.0E4

Parentals are pSGE1.1E4 (29) and pSGE0.0E4 (38), contains a single alpha globin followed by beta globin gene containing the Presbyterian mutation. pSGE0.0E4 was digested with SmaI and PstI, the alpha globin gene gel purified, and ligated into pSGE1.1E4 from which the di-alpha gene had been removed with the same restriction enzymes.

41. pSGE223

Parental is commercially available pGEM-1. NheI restriction site has been eliminated by digestion with NheI, T4 polymerase filing, and ligation.

42. pSGE224

pSGE223 containing the beta Presbyterian gene.

Parentals are pSGE223 (41) and pSGE1.1E4 (29)

43. pSGE229

pSGE224 containing the 5 glycine di-beta linker sequences

Parental plasmid is pSGE224 (42) The linker sequences are synthetic DNA.

44. pSGE234

contains the di-beta globin gene. Parentals are pSGE224 (42) and pSGE229 (43). The NheI fragment from pSGE229 contains the beta Presbyterian gene and the 5 glycine di-beta linker was ligated into NheI digested pSGE224.

45. pSge1.1E5

Parental is pSGE1.1E4 (29). 5' end of the tet R gene was removed by digestion with BamHI.

45a. pSGE0.0E5

Parental is pSGE0.0E4 (38). BamHI fragment encoding 5' end of tet R gene has been removed.

46. pSGE1.0E5

single alpha globin gene followed by beta Presbyterian gene. Parentals are pSGE0.0E5 (45a) and pJRV-83a (23). The beta globin gene containing the Presbyterian mutation was excised from (23) and ligated into (45a), from which the wild type beta globin gene had been removed with the same two restriction enzymes.

47. pSGE1.05E5

single alpha globin gene followed by 5 glycine linked di-beta gene. Parentals are pSGE234 (44) and pSGE1.0E5 (46). The PstI into HindIII fragment from pSGE234 containing the di-beta gene was ligated into pSGE1.0E5 from which the single beta globin gene had been removed using the same restriction enzymes.

48. pSGE1.05E4

Parentals are pSGE1.05E5 (47) and pSGE1.1E4 (29). The BamHI fragment encoding the 5' end of the tet R gene from pSGE1.1E4 was ligated into the BamHI site of pSGE1.05E5.

YEAST VECTORS

49. pSK+

Commercially available from Stratagene, LaJolla, Calif.

50. pGS2488

Derived from (49) by insertion of synthetic GAP 491 transcriptional initiation site.

51. pGS2888

Derived from (50) by conversion of KpnI site to SphI site.

52. pGS4788

Derived from (51) by insertion of synthetic GAL_(UAS) into (51) to form GALGAP hybrid promoter.

53. pLC IIFX-β-globin

Plasmid available from Kiyoshi Nagai, Medical Research Council, London, England; bears β globin gene

54. pUC19

Plasmid commercially available from Bethesda Research Laboratories, Gaithersburg, Md.

55. pSUC2-6Σ

Plasmid described by Stetler, et al., Biotechnology, 7:55-60 (1989)

56. pUC19β-globin

Derived from (53) and (54)

57. pGS1188

β-globin gene (from 56) is under control of the sigma promoter and the MFalpha terminator both from (55).

58. pGS3588

Derived from (57) and (52); sigma promoter replaced by GALGAP promoter of (52).

59. pGS3888

Derived from (58) by conversion of SmaI site to XhoI site.

60. pα-MRC

Plasmid available from K. Nagai, MRC; bears alpha-globin gene.

61. pGS4088

Derived from (59) by insertion of α-globin gene, replacing the β-globin gene.

62. pSN(+)

Derived from (49) by changing KpnI site to NotI site.

63. pGS4888

Derived from (62) by insertion of pGGAP-α-globin expression cassette from (48).

64. pGS189

Derived from (63) by insertion of pGGAP-β-globin expression cassette from (47). Plasmid thus bears both GALGAP α globin and GALGAP β globin genes.

65. pC1U

Plasmid described by Stetler, et al.; Biotechnology, 7:55-60 (1980)

66. PC1N

Derived from (65) by addition of NotI site.

67. pGS289

Derived from (66) and (64) by insertion of α-globin and β-globin expression cassettes.

68. pGS389

Same as (67), but with insert orientation reversed.

69. pYRp7

Plasmid described in Strathern, et al., The Molecular Biology of the Yeast Saccharomyces (Cold Spring Harbor, 1981); source of TRP1 gene.

70. pC1T

Derived from (65) and (69) by replacing Ura3 gene (65) with Trp1 gene (69).

71. pGS4988

Derived from (70) by insertion of β-globin expression cassette from (59).

72. pGS4488

Derived from (65) by insertion of α-globin expression cassette from (61).

73. pGS4688

Same as (72), but with insert orientation reversed.

74. pGS4888

Derived from (62) by insertion of GGAP-alpha globin expression cassette from (61).

75. pGS1889

Derived from (74) by removal of PstI site.

76. pGS1989

Derived from (75) by removal of SpeI site.

77. pGS2189

Derived from (74) and (76); encodes di-alpha globin with RGGV linker.

78. pGS2989

Derived from (77) and (60), contains di-alpha globin gene and beta globin gene.

79. pGS3089

Derived from (66) by insertion of expression cassette from (78).

80. Phagescript

Phage commercially available from Stratagene.

81. Phage Mpβ-globin

Derived from (80) by insertion of GALGAP promoter and β-globin gene from (78).

82. pGS3089 RGV desβ

Derived from (79) by deletion of XhoI fragment containing β-globin expression cassette.

83. pGS3889

(79) with Presbyterian (βN108K) mutation.

84. pGS5189

(79) with Agenogi (βE90K) mutation.

85. pGS5689

(79) with Kansas (βN102T) mutation.

86. pGS4989

(79) with βV67I mutation.

87. pGS2989 RPV

(77) with RPV di-alpha linker

88. pGS2989 RGV

(77) with RGV di-alpha linker

89. pGS3089

(66) with Hgb expression cassette from (88)

90. pGS3089 RPV

(66) with Hgb expression cassette from (87)

                  TABLE 300                                                        ______________________________________                                         TABLE OF BACTERIAL STRAINS                                                            Strain   Availability Expression                                        ______________________________________                                         1        BL21       Brookhaven   TAC                                           2        JM109      Commercial   TAC                                           3        SCS1       Commercial   TAC                                           4        JM110      ATCC         TAC                                           5        LE392      ATCC         TAC                                           6        23722      ATCC         TAC                                           7        W3110      ATCC         TAC                                           8        AG-1       Commercial   TAC                                           9        DH1        ATCC         TAC                                           10       NM554      Commercial   TAC                                           11       N99cl+     Commercial   PL                                            12       N4830-1    Commercial   PL                                            ______________________________________                                    

                  TABLE 400                                                        ______________________________________                                         HIGH OXYGEN AFFINITY, NATURALLY OCCURRING                                      HEMOGLOBIN MUTANTS                                                             Structure           Name                                                       ______________________________________                                         A. Alpha Chain Mutants                                                          6 (A4)    Asp-->Ala    Sawara                                                            Asp-->Asn    Dunn                                                              Asp-->Val    Ferndown                                                          Asp-->Tyr    Woodville                                                         Lys-->Asn    Albany-Suma                                             40 (C5)   Lys-->Glu    Kariya                                                  44 (CE2)  Pro-->Leu    Milledgeville                                                     Pro-->Arg    Kawachi                                                 45 (CE3)  His-->Arg    Fort de France                                          85 (F6)   Asp-->Asn    G-Norfolk                                               92 (FG4)  Arg-->Gln    J-Cape Town                                                       Arg-->Leu    Chesapeake                                              95 (G2)   Pro-->Leu    G-Georgia                                                         Pro-->Ser    Rampa                                                             Pro-->Ala    Denmark Hill                                                      Pro-->Arg    St. Luke's                                              97 (G4)   Asn-->Lys    Dallas                                                 126 (H9)   Asp-->Asn    Tarrant                                                141 (HC3)  Arg-->His    Suresnes                                                          Arg-->Ser    J-Cubujuqui                                                       Arg-->Leu    Legnano                                                B. Beta Chain Mutants                                                           2 (NA2)   His-->Arg    Deer Lodge                                                        His-->Gln    Okayama                                                 20 (B2)   Val-->Met    Olympia                                                 23 (B5)   Val-->Asp    Strasbourg                                                        Val-->Phe    Palmerston North                                        34 (B16)  Val-->Phe    Pitie-Salpetriere                                       36 (C2)   Pro-->Thr    Linkoping                                               37 (C3)   Trp-->Ser    Hirose                                                  40 (C6)   Arg-->Lys    Athens-Ga                                                         Arg-->Ser    Austin                                                  51 (D2)   Pro-->Arg    Willamette                                                        Leu-->His    Brisbane                                                79 (EF3)  Asp-->Gly    G-Hsi-Tsou                                                        Lys-->Thr    Rahere                                                            Lys-->Met    Helsinki                                                89 (F5)   Ser-->Asn    Creteil                                                           Ser-->Arg    Vanderbilt                                              94 (FG1)  Asp-->His    Barcelona                                                         Asp-->Asn    Bunbury                                                 96 (FG3)  Leu-->Val    Regina                                                  97 (FG4)  His-->Gln    Malmo                                                             His-->Leu    Wood                                                    99 (G1)   Asp-->Asn    Kempsey                                                           Asp-->His    Yakima                                                            Asp-->Ala    Radcliffe                                                         Asp-->Tyr    Ypsilanti                                                         Asp-->Gly    Hotel-Dieu                                                        Asp-->Val    Chemilly                                               100 (G2)   Pro-->Leu    Brigham                                                101 (G3)   Glu-->Lys    British Columbia                                                  Glu-->Gly    Alberta                                                           Glu-->Asp    Potomac                                                103 (G5)   Phe-->Leu    Heathrow                                               109 (G11)  Val-->Met    San Diego                                              121 (GH4)  Glu-->Gln    D-Los Angeles                                                     Pro-->Gln    Tu Gard                                                           Ala-->Pro    Crete                                                  140 (H18)  Ala-->Thr    St.-Jacques                                            142 (H20)  Ala-->Asp    Ohio                                                   143 (H21)  His-->Arg    Abruzzo                                                           His-->Gln    Little Rock                                                       His-->Pro    Syracuse                                               144 (HC1)  Lys-->Asn    Andrew-Minneapolis                                     145 (HC2)  Tyr-->His    Bethesda                                                          Tyr-->Cys    Rainier                                                           Tyr-->Asp    Fort Gordon                                                       Tyr-->Term   McKees Rocks                                           146 (HC3)  His-->Asp    Hiroshima                                                         His-->Pro    York                                                              His-->Leu    Cowtown                                                ______________________________________                                    

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 7                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 37 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (ix) FEATURE:                                                                  (A) NAME/KEY: Modified-site                                                    (B) LOCATION: 1                                                                (D) OTHER INFORMATION: /label= AA- 1                                           /note= "Lys modified as Ac-lys"                                                (ix) FEATURE:                                                                  (A) NAME/KEY: Modified-site                                                    (B) LOCATION: 37                                                               (D) OTHER INFORMATION: /label= AA- 37                                          /note= "Leu is modified as Leu-amide"                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        LysCysAlaGluLeuGluGlyArgLeuGluAlaLeuGluGlyArgLeu                               151015                                                                         GluAlaLeuGluGlyArgLeuGluAlaLeuGluGlyArgLeuGluAla                               202530                                                                         LeuGluGlyLysLeu                                                                35                                                                             (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 16 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        GlyGluLeuGluGluLeuLeuLysLysLeuLysGluLeuLeuLysGly                               151015                                                                         (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 12 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        LeuArgArgGlnIleAspLeuGluValThrGlyLeu                                           1510                                                                           (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 35 amino acids                                                     (B) TYPE: amino acid                                                           (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        LysCysAlaGluLeuGluGlyLysLeuGluAlaLeuGluGlyLysLeu                               151015                                                                         GluAlaLeuGluGlyLysLeuGluAlaLeuGluGlyLysLeuGluAla                               202530                                                                         LeuGluGly                                                                      35                                                                             (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 27 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CCGAACGCGTTGTGCGCTCTGTCTGAT27                                                  (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        GGTGCTCACGTTGATTGCATGCCGAACGCG30                                               (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        CTGACCAACGCTGTTTGCCACGTTGATGAT30                                               __________________________________________________________________________ 

We claim:
 1. A non-naturally occurring, genetically fused globin-like polypeptide comprising two or more globin-like domains connected by one or more peptide bonds into a single unbranched polypeptide chain, said chain being capable of associating with alpha and/or beta globin and incorporating heme to form a hemoglobin-like protein with reversible oxygen binding activity.
 2. The polypeptide of claim 1 which comprises only two globin-like domains.
 3. The polypeptide of claim 2, said polypeptide being a di-alpha globin-like polypeptide comprising a first and second alpha globin-like polypeptide domains, said chain being capable of associating with beta globin.
 4. The polypeptide of claim 2, said polypeptide being a di-beta globin-like polypeptide comprising an essentially first and second beta-globin-like polypeptide domains, said chain being capable of associating with alpha-globin activity.
 5. The polypeptide of claim 2, said polypeptide being a pseudodimeric globin-like polypeptide comprising an alpha globin-like polypeptide domain and a beta globin-like polypeptide domain.
 6. The polypeptide of claim 1 wherein a first and a second globin-like polypeptide domain are connected by a peptide linker of one or more amino acids.
 7. The polypeptide of claim 6 wherein the amino acids of the linker are selected from the group consisting of lysine, aspartic acid, arginine, serine, asparagine, proline and glycine.
 8. The polypeptide claim 7 wherein the linker consists of one or more glycines.
 9. The polypeptide of claim 1 wherein each globin-like domain has at least 75% sequence identity with human alpha globin or human beta globin.
 10. The polypeptide of claim 1 wherein each globin-like domain is identical in sequence with human alpha globin or human beta globin.
 11. The polypeptide of claim 9, wherein said polypeptide comprises only two globin-like domains.
 12. The polypeptide of claim 1 wherein at a predetermined position, a first globin-like domain has a cysteine residue and a second globin-like domain has a residue other than cysteine.
 13. The polypeptide of claim 12 wherein each globin-like domain has at least 75% sequence identity with human alpha globin or human beta globin.
 14. The polypeptide of claim 13 wherein said globin-like domains of said polypeptide are identical except for that, at one or more predetermined positions, a first globin-like domain has a cysteine residue and a second globin-like domain has a residue other than cysteine.
 15. The polypeptide of claim 14 wherein one of said predetermined positions is(a) a position in the alpha globin-like domain corresponding to a human alpha globin position selected from the group consisting of his72, asn78, asn68, ala7l, thr67, lys7, lys11, thr8, ala12, thr118, lys16, ala45, glu116, gly15, his112, thr24, glu23, lys60, lys56, his50, gly51, glu53, ser49, asp47, gln54, his45, lys90, ala82, lys6l, ala19, his20, asp85, ser81, asp75, asp74, lys139, asp64, and gly18, or (b) a position in a beta globin-like domain corresponding to a human beta globin position selected from the group consisting of asp79, his2, leu3, thr4, glu6, ser9, thr12, ala13, gly16, lys17, val18, asn19, val20, asp21, glu22, lys65, ser72, ala76, his77, asp79, asn80, gly83, ala86, thr87, glu90, lys95, lys59, glu43, ser44, asp47, ser49, thr50, ala53, asp52, lys61, glu121, lys120, thr123, lys66, asp73, ala62, his116, and his117.
 16. The polypeptide of claim 1 wherein said polypeptide contains one or more mutations rendering said protein a low affinity hemoglobin mutant.
 17. The polypeptide of claim 16 wherein at least one of said low affinity mutations is(a) a mutation of an alpha-globin mutation selected from the group consisting of 43 phe→leu, 43 phe→val, 86 leu→arg, 94 asp→asn, 46 phe→thr, 46 phe→ser, 46 phe→ala, 58 his→phe, 58 his→trp, 61 lys→thr, 61 lys→ser, 61 lys→met, 61 lys→asn, 62 val→leu, 62 val→ile, 62 val→phe, 62 val→trp, 65 ala→asp, 94 asp→gln, 94 asp→thr, 94 asp→ser, 94 asp→lys, 94 asp→gly, and 94 asp→arg, or (b) a mutation of a beta globin-like domain which corresponds to a human beta globin mutation selected from the group consisting of 1 val→acetyl ala, 21 asp→gly, 24 gly→asp, 37 trp→arg, 38 thr→pro, 42 phe→ser, 42 phe→leu, 42 phe→val, 45 phe→ser, 48 leu→arg, 61 lys→met, 65 lys→gln, 66 lys→thr, 67 val→asp, 70 ala→asp, 73 asp→tyr, 73 asp→asn, 73 asp→val, 82 lys→thr, 83 gly→asp, 90 glu→gly, 90 glu→lys, 91 leu→arg, 102 asn→thr, 102 asn→ser, 102 asn→tyr, 102 asn→lys, 107 gly→arg, 108 asn→asp, 108 asn→lys, 111 val→phe, 113 val→glu, 136 gly→asp, 21 asp→ala, 21 asp→ser, 45 phe→ala, 45 phe→thr, 45 phe→val, 63 his→phe, 63 his→trp, 66 lys→ser, 66 lys→asn, 67 val→phe, 67 val→trp, 67 val→ile, 70 ala→glu, 70 ala→ser, 70 ala→thr, 96 leu→phe, 96 leu→his, 96 leu→lys, 98 val→trp, 98 val→phe, 102 asn→asp, 102 asn→glu, 102 asn→arg, 102 asn→his, 102 asn→gly, 108 asn→arg, and 108 asn→glu.
 18. The polypeptide of claim 1, wherein said polypeptide contains one or more mutations rendering said protein a high affinity hemoglobin mutant.
 19. The polypeptide of claim 18 wherein at least one of said high affinity mutations is(a) a mutation of an alpha globin-like domain which corresponds to a human alpha globin mutation selected from the group consisting of 6 Asp→Ala, 6 Asp→Asn, 6 Asp→Val, 6 Asp→Tyr, 6 Lys→Asn, 40 Lys→Glu, 44 Pro→Leu, 44 Pro→Arg, 45 His→Arg, 85 Asp→Asn, 92 Arg→Gln, 92 Arg→Leu, 95 Pro→Leu, 95 Pro→Ser, 95 Pro→Ala, 95 Pro→Arg, 97 Asn→Lys, 126 Asp→Asn, 141 Arg→His, 141 Arg→Ser, and 141 Arg→Leu, or (b) a mutation of a beta globin-like domain which corresponds to a human beta globin mutation selected from the group consisting of 2 His→Arg, 20 Val→Met, 23 Val→Asp, 23 Val→Phe, 34 Val→Phe, 36 Pro→Thr, 37 Trp→Ser, 40 Arg→Lys, 40 Arg→Ser, 51 Pro→Arg, 51 Leu→His, 79 Asp→Gly, 79 Lys→Thr, 79 Lys→Met, 82 Lys→Asn, 89 Ser→Asn, 89 Ser→Arg, 94 Asp→His, 94 Asp→Asn, 96 Leu→Val, 97 His→Gln, 97 His→Leu, 99 Asp→Asn, 99 Asp→His, 99 Asp→Ala, 99 Asp→Tyr, 99 Asp→Gly, 99 Asp→Val, 100 Pro→Leu, 101 Glu→Lys, 101 Glu→Gly, 101 Glu→Asp, 103 Phe→Leu, 109 Val→Met, 121 Glu→Gln, 121 Pro→Gln, 121 Ala→Pro, 140 Ala→Thr, 142 Ala→Asp, 143 His→Arg, 143 His→Gln, 143 His→Pro, 144 Lys→Asn, 145 Tyr→His, 145 Tyr→Cys, 145 Tyr→Asp, 145 Tyr→Term, 146 His→Asp, 146 His→Pro, and 146 His→Leu.
 20. The polypeptide of claim 1 wherein each alpha globin-like domain of said genetically fused polypeptide is identical to human alpha globin save for(a) one or more an oxygen affinity-modifying modifications characterized as(i) one or more low affinity mutations, selected from the group consisting of 43 phe→leu, 43 phe→val, 86 leu→arg, 94 asp→asn, 46 phe→thr, 46 phe→ser, 46 phe→ala, 58 his→phe, 58 his→trp, 61 lys→thr, 61 lys→ser, 61 lys→met, 61 lys→asn, 62 val→leu, 62 val→ile, 62 val→phe, 62 val→trp, 65 ala→asp, 94 asp→gln, 94 asp→thr, 94 asp→ser, 94 asp→lys, 94 asp→gly, and 94 asp→arg, or (ii) one or more high affinity mutations selected from the group consisting of 6 Asp→Ala, 6 Asp→Asn, 6 Asp→Val, 6 Asp→Tyr, 6 Lys→Asn, 40 Lys→Glu, 44 Pro→Leu, 44 Pro→Arg, 45 His→Arg, 85 Asp→Asn, 92 Arg→Gln, 92 Arg→Leu, 95 Pro→Leu, 95 Pro→Ser, 95 Pro→Ala, 95 Pro→Arg, 97 Asn→Lys, 126 Asp→Asn, 141 Arg→His, 141 Arg→Ser, and 141 Arg→Leu; and/or (b) a Cys substitution at one or more sites selected from the group consisting of: his72, asn78, asn68, ala71, thr67, lys7, lys11, thr8, ala12, thr118, lys16, ala45, glu116, gly15, his112, thr24, glu23, lys60, lys56, his50, gly51, glu53, ser49, asp47, gln54, his45, lys90, ala82, lys61, ala19, his20, asp85, ser81, asp75, asp74, lys139, asp64, and gly18.
 21. The polypeptide of claim 1 wherein the beta globin-like domain of said genetically fused polypeptide is identical to human beta globin save for(a) one or more oxygen affinity-modifying modifications characterized as(i) one or more low affinity mutations selected from the group consisting of, 1 val→acetyl ala, 21 asp→gly, 24 gly→asp, 37 trp→arg, 38 thr→pro, 42 phe→ser, 42 phe→leu, 42 phe→val, 45 phe→ser, 48 leu→arg, 61 lys→met, 65 lys→gln, 66 lys→thr, 67 val→asp, 70 ala→asp, 73 asp→tyr, 73 asp→asn, 73 asp→val, 82 lys→thr, 83 gly→asp, 90 glu→gly, 90 glu→lys, 91 leu→arg, 102 asn→thr, 102 asn→ser, 102 asn→tyr, 102 asn→lys, 107 gly→arg, 108 asn→asp, 108 asn→lys, 111 val→phe, 113 val→glu, 136 gly→asp, 21 asp→ala, 21 asp→ser, 45 phe→ala, 45 phe→thr, 45 phe→val, 63 his→phe, 63 his→trp, 66 lys→ser, 66 lys→asn, 67 val→phe, 67 val→trp, 67 val→ile, 70 ala→glu, 70 ala→ser, 70 ala→thr, 96 leu→phe, 96 leu→his, 96 leu→lys, 98 val→trp, 98 val→phe, 102 asn→asp, 102 asn→glu, 102 asn→arg, 102 asn→his, 102 asn→gly, 108 asn→arg, and 108 asn→glu, or (ii) one or more high affinity mutations selected from the group consisting of, 2 His→Arg, 20 Val→Met, 23 Val→Asp, 23 Val→Phe, 34 Val→Phe, 36 Pro→Thr, 37 Trp→Ser, 40 Arg→Lys, 40 Arg→Ser, 51 Pro→Arg, 51 Leu→His, 79 Asp→Gly, 79 Lys→Thr, 79 Lys→Met, 82 Lys→Asn, 89 Ser→Asn, 89 Ser→Arg, 94 Asp→His, 94 Asp→Asn, 96 Leu→Val, 97 His→Gln, 97 His→Leu, 99 Asp→Asn, 99 Asp→His, 99 Asp→Ala, 99 Asp→Tyr, 99 Asp→Gly, 99 Asp→Val, 100 Pro→Leu, 101 Glu→Lys, 101 Glu→Gly, 101 Glu→Asp, 103 Phe→Leu, 109 Val→Met, 121 Glu→Gln, 121 Pro→Gln, 121 Ala→Pro, 140 Ala→Thr, 142 Ala→Asp, 143 His→Arg, 143 His→Gln, 143 His→Pro, 144 Lys→Asn, 145 Tyr→His, 145 Tyr→Cys, 145 Tyr→Asp, 145 Tyr→Term, 146 His→Asp, 146 His→Pro, and 146 His→Leu; and/or (b) a Cys substitution at one or more sites selected from the group consisting of: asp79, his2, leu3, thr4, glu6, ser9, thr12, ala13, gly16, lys17, val18, asn19, val20, asp21, glu22, lys65, ser72, ala76, his77, asp79, asn80, gly83, ala86, thr87, glu90, lys95, lys59, glu43, ser44, asp47, ser49, thr50, ala53, asp52, lys61, glu121, lys120, thr123, lys66, asp73, ala62, his116, and his117.
 22. The polypeptide of claim 17 wherein at least one beta-globin-like domain includes the low affinity Presbyterian mutation, 108 Asn→Lys.
 23. The polypeptide of claim 19 wherein at least one beta-globin-like domain includes the high affinity providence mutation, 82 Lys→Asn. 